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Abstract. We continue our analysis of the number partitioning problem with 
n weights chosen i.i.d. from some fixed probability distribution with density 
p. In Part I of this work, we established the so-called local REM conjecture of 
Bauke, Franz and Mertens. Namely, we showed that, as n — » oo, the suitably 
resealed energy spectrum above some fixed scale cx tends to a Poisson process 
with density one, and the partitions corresponding to these energies become 
asymptotically uncorrelated. In this part, we analyze the number partitioning 
problem for energy scales a„ that grow with n, and show that the local REM 
conjecture holds as long as n' -^I^OLn 0, and fails if a„ grows like nn^l^ with 
re > 0. 

We also consider the SK-spin glass model, and show that it has an analogous 
threshold: the local REM conjecture holds for energies of order o(n), and fails 
if the energies grow like nn with k > 0. 



1. Introduction 

1.1. Number Partitioning. In this paper we continue the study of the energy 
spectrum of the number partition problem (Npp) with randomly chosen weights. 
We refer the reader to [BCMNOS] for a detailed motivation of this study, but for 
completeness, we repeat the main definitions. 

We consider random instances of the Npp with weights Xi , . . . , X„ € M taken to 
be independently and identically distributed according to some density with 
finite second moment (for simplicity of notation, wc will choose the second moment 
to be one). Given these weights, one seeks a partition of these numbers into two 
subsets such that the sum of numbers in one subset is as close as possible to the 
sum of numbers in the other subset. Each of the 2" partitions can be encoded as 
(T G { — 1,+1}", where (Ji = 1 if Xi is put in one subset and di = — 1 if is put in 
the other subset; in the physics literature, such partitions cr are identified with Ising 
spin configurations. The cost function to be minimized over all spin configurations 
cr is the energy 



E{a) ^ ^ 
\ n 



(1.1) 



where, as in |BCMN0'5] . we have inserted a factor l/\/n to simplify the equations 
in the rest of the paper. 

Note that this scaling implies that the typical energies are of order one, and the 
maximal energies are of order ^Jn. Indeed, if cr is chosen uniformly at random and 
Xi, . . . , Xn are i.i.d. with second moment one, the random variable (TiXi 
converges to a standard normal as n ^ oo, implying in particular that for a typical 
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configuration, E{cr) is of order one. The maximal energy, on tlie otlier liand, is 
equal to n~^/^ By the law of large numbers, this implies that the maximal 

energy is asymptotically equal to ^/n times the expectation of \X\. 

As usual, the correlation between two different partitions cr and cr is measured 
by the overlap between cr and cr, defined as 

1 " 

q{(T,a) = -S^ak^k- (1.2) 

k=l 

Note that the spin configurations cr and — cr correspond to the same partition 
and therefore of course have the same energy. Thus there are N = 2"~^ distinct 
partitions and (with probability one) also N distinct energies. The energy spectrum 
is the sorted increasing sequence E'l, ■■■,En of the energy values corresponding to 
these N distinct partitions. Taking into account that, for each i, there are two 
configurations with energy Ei, we define cr^*) to be the random variable which is 
equal to one of these two configurations with probability 1/2, and equal to the other 
with probability 1/2. Then the overlap between the configurations corresponding 
to i^^ and j'^ energies is the random variable q{(T^'^\ cr^^^). 

As noted in .BCMN05) . neither the distribution of the energies, nor the distri- 
bution of the overlaps changes if one replaces the density p{X) by the symmetrized 
density ^{p{X)+ p{—X)). We may therefore assume without loss of generality that 
p{X) = p{—X). Under this assumption, it is easy to see that the energies E^er) for 
the different configurations cr are identically distributed. Let us stress, however, 
that these energies are not independently distributed; the energies between differ- 
ent configurations are correlated random variables. Indeed, there are A'' = 2""^ 
energies, Ei, . . . En, constructed from only n independent variables Xi, . . . A"„. 

Consider now a very simple model, the so-called random energy model (REM) 
first introduced by Derrida jPerSH in a different context. The defining property 
of the REM is that A^ energies E{(t) are taken to be independent, identically dis- 
tributed random variables. In the REM, the asymptotic energy spectrum for large 
A' can be easily determined with the help of large order statistics: if the energies 
are ordered in increasing order and a > is any fixed energy scale, the suitably 
rescaled energy spectrum above a converges to a Poisson process. More precisely, 
if the distribution of E((t) has a non-vanishing, continuous density g{a) at a and 
Er+i is the first energy above a, then the rescaled energies (E'r+i ~ c()Ng{a), 
{Er+2 — a)Ng{a), . . . converge to a Poisson process with density one. 

In spite of the correlations between the energies E{(t) in the Npp, it had been 
conjectured [MerOOl iBFMOl) that as n ^ oo, the energy spectrum above any fixed 
energy a behaves asymptotically like the energy spectrum of the REM, in the sense 
that the suitably rescaled spectrum above a again becomes a Poisson process. 
In |BFM04| it was also conjectured that the overlaps corresponding to adjacent 
energies are asymptotically uncorrelated, so that the suitably normalized overlaps 
converges to a standard normal. These two, at first sight highly speculative, claims 
were collectively called the local REM conjecture. This conjecture was supported 
by detailed simulations. 

In Part I of this paper [BCMNOS] . we proved the local REM conjecture for the 
Npp with a distribution p that has finite second moment and lies in L^+^ for some 
e > 0. More precisely, under these conditions, we proved that for all i ^ j, the 
suitably normalized overlap between the configurations corresponding to the i"^ and 
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j energy above a becomes asymptotically normal, and that the rescaled energies 
{Er+i — a)£^~^, [Er+2 — a)C7"^i • • ■ '^ith rescaling factor 



converge to a Poisson process with density one. Recalling that the normalization 
in corresponds to typical energies of order one, this this establishes the local 
REM conjecture for typical energies. 

In |BFM04| . the authors expressed the belief that the weak convergence of the 
rescaled energies to a Poisson process should extend to values of a that grow slowly 
enough with n, although computational limitations prevented them from supporting 
this stronger claim by simulations. At first, one might think that the local REM 
conjecture could hold for a = o{^/n). Indeed, since the maximal energy is of order 
it is clear that the conjecture is false for a ~ c^fn with large enough c. But if 
this were the only obstruction, then one might hope that the conjecture could hold 
up to a = o{ypri). As we will see in this paper, this is not the case; the conjecture 
will only hold for a = o(n^^'^). 

Before stating this precisely, let us be a little more careful with the scaling of the 
energy spectrum. Note that the scaling factor p. 3(1 is equal to [Ng{a)]~^ , where 
N ~ 2"^^ is the number of energies, and g{a) ~ ■\/2/7re~""/^ is the density of the 
absolute value of a standard normal, in accordance with the expected asymptotic 
density of Eicr) according to the local limit theorem. But it is well know that, in 
general, the local limit theorem does not hold in the tails of the distribution. For 
growing q;„, the REM conjecture should therefore be stated with a scaling factor 
that is equal to the inverse of 2""-'^ multiplied by the density of the energy E{cr) at 
a. We call the REM conjecture with this scaling the modified REM conjecture. It is 
this modified REM conjecture that one might naively expect to hold for a = o{y/n). 

It turns out, however, that at least for the Npp, this distinction docs not make 
much of a difference. For a = o(n^/'*), the original and the modified conjectures 
are equivalent, and the original REM conjecture holds, while for a growing like a 
constant times n^/'*, both the original and the modified REM conjectures fail. So 
for the Npp, the threshold for the validity of the REM conjecture is n^/"', not y/n 
as one might have naively guessed. 

1.2. The SK Spin Glass. In a follow-up paper to |BFM04| . Bauke and Mertens 
generalized the local REM conjecture for the Npp to a large class of disordered 
systems, including many other models of combinatorial optimization as well as 
several spin glass models |BM04| . Motivated by this conjecture, Bovier and Kurkova 
developed an axiomatic approach to Poisson convergence, covering, in particular 
many types of spin glasses like the Edwards- Anderson model and the Sherrington- 
Kirkpatrick model. 

The Shcrrington-Kirkpatrick model (SK model) is defined by the energy function 



As before, cr is a spin configuration in { — but now the random input is 
given in terms of random variables Xij with i,j G {1, . . . ,n}, usually taken to 
be i.i.d. standard normals. Again E{cr) = E{—(t), leading to = 2"^^ a priori 




(1.3) 




(1.4) 



4 CHRISTIAN BORGS\ JENNIFER CHAYES\ STEPHAN MERTENS^, CHANDRA NAIR^ 



different energies Ei < E2 < ■ ■ ■ < En- Note, however, that the normahzation in 
H1.4(l corresponds to typical energies of order ^/n and maximal energies of order n, 
in accordance with the standard physics notation. 

Consider an energy scale q;„ > 0, and let Er+i be the first energy above a„. 
To obtain the REM approximation for the SK model, we observe that the random 
variable E{cr) is a Gaussian with density 

~g{x) = -^e-^'/^-. (1.5) 

The REM approximation for the SK model therefore suggests that the rescaled 
energy spectrum (-E^+i — ctn)S,n^ , {Er+2 ~ "n)Cn ^1 ■ • ■ '^ith rescaling factor 

|„ = \/2^2-("-i)e^ (1.6) 

converges to a Poisson process with density one. Recalling that typical energies are 
now of order ^/n, the local REM conjecture for typical energies in the SK model thus 
claims that for a„ = 0{^/n), the rescaled energy spectrum converges to a Poisson 
process with density one, with overlaps which again tend to zero as n ^ 00. 

This conjecture was proved in a very nice paper by Bovier and Kurkova jBKOSaj . 
In fact, they proved that the conjecture remains valid as long as q;„ = 0{n^) 
with ry < 1. To get some insight into still faster growing a„, Bovier and Kurkova 
then considered the generalized random energy model (GREM) of Derrida )Uer85j . 
For this model, they proved |BK05bj that the local REM conjecture holds up to 
Q^n = (3on, where /3o is the inverse transition temperature of the GREM, and fails 
once a„ exceeds this threshold. Based on these results for the GREM, Bovier 
and Kurkova then suggested |BK05cj that the f3o might be the threshold for other 
disordered spin systems as will. 

As we will show in this paper, this is not the case, at least not for the SK model, 
for which we prove that the REM conjecture holds up to the threshold = o(n), 
and becomes invalid as soon as limsupa„/n > 0, see Theorem 12.21 below for the 
precise statement. Thus even the scaling with n of the threshold does not obey the 
naive expectation derived from the GREM. Note that for the SK model there is no 
difference between the original REM conjecture and the modified REM conjecture, 
since the density of E{a) is Gaussian for all energy scales. 

1.3. Organization of the Paper. This paper is organized as follows. In the next 
section, we precisely state the assumptions on our model and formulate our main 
results, see Theorems 12.11 and 12.21 In Sectional we then describe our main proof 
strategy for the Npp. Since the proof strategy for the SK model only requires 
minor modifications (the proof is, in fact, much easier), we defer the discussion 
of this model to the last subsection. Section 13.71 The next four sections contain 
the details of the proof: As a warmup, we start with the Npp with Gaussian 
noise, where our strategy is most straightforward. Next, in Sectional we move 
to the Npp with a general distribution. This section contains the meat of our 
proof: the establishment of a large deviations estimate for the probability density 
of several (weakly dependent) random variables. In Section we give the proof of 
Theorem 12.21 and in Section [3 we establish several auxiliary results needed in the 
rest of the paper. We conclude the paper with a section summarizing our results 
and discussing possible extensions. Section |H1 
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2. Statement of Results 

2.1. Number Partitioning. Let Xi, ...,Xn be independent random variables dis- 
tributed according to the common density function p{x). We assume that p has 
second moment one and satisfies the bound 

|l+c 



dx < oo (2.1) 

for some e > 0. Since neither the distribution of the overlaps nor the energy 
spectrum changes if we replace p{x) by ^{p{x) + p{~x)), we will assume that p{x) = 
p{~x). Wc use the symbol P„(-) to denote the probability with respect to the joint 
probability distribution of Xi, ...,X„, and the symbol E„(-) to denote expectations 
with respect to Pri(-)- 

As in the introduction, we represent the 2" partitions of the integers {1, .., n} as 
spin configurations cr G {—1,-1-1}" and define the energy of cr as in (|1.1() . Recalling 
that the distribution of E{a) docs not depend on cr, let be the density of 

E{(t), and let ^„ be the modified scaling factor 

^ r>„_i 7 \- (2-2) 

We now introduce a continuous time process {Nn{t) : t > 0} where Nn{t) is defined 
as the number of points in the energy spectrum that lie between a„ and a„ -I- i^„. 

Let El, ... , Epf be the increasing spectrum of the energy values corresponding 
to the N = 2"^^ distinct partitions. Given a„ > 0, let r„ be the random variable 
defined by £'r„ < a„ < E^^+i- For j > i > 0, we then define the rescaled overlap 
Qij as the random variable 

Q^^ = iE"'^'9('^» (2-3) 

where the sum goes over the four pairs of configurations with E{a) = Er^+i and 
E{a) = Er^+3- Instead of the overlap Qij, we will sometimes consider the following 
variant: consider two distinct configurations cr and cr chosen uniformly at random 
from all pairs of distinct configurations. We then define Qn,t as the rcscalcd overlap 
n}/'^q{<j, cr) conditioned on the event that E{(t) and E{a) fall into the energy 
interval [a„,Q;„ -|- tS^n]- We will refer to Qn,t as the overlap between two typical 
configurations contributing to iV„ (i) . 

The main results of this paper are statements ii) and iii) of the following theorem. 
The first statement is a corollary of the proof of ii) and iii) and implies that for 
a„ ~ o{n^/^), the original and the modified REM conjecture are equivalent. 

Theorem 2.1. Let Xi, ...,X„ g R &e i.i.d. random variables drawn from a prob- 
ability distribution with second moment one and even density p. If p obeys the 
assumption (|2.1|) for some e > and has a Fourier transform that is analytic in a 
neighborhood of zero, then the following holds: 

i) Let gn{-) be the density of E{<t), and let g(a) = ■\/2/7re^" //a„ = o{n^^^), 
then 5„(a„) = g(a„)(l -l- o(l)). 

ii) Let Un = o{n^^^), and let j > i > be arbitrary integers not depending on n. 
As n — !■ oo, the process N„(t) converges weakly to a Poisson process with density 
one, and both Qij and Qn,t converge in distribution to standard normals. 
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in) Let an = nn^^'^ for some finite k > 0. Then E„[iV„(t)] = t + o(l), hut the 
process Nn(t) does not converge to a Poisson process, and Qn,t does not converge 
to a standard normal. 

In order to prove the above theorem, we will analyze the factorial moments of 
the process Zn{t). We will show that for a„ = o{n^/'^), they converge to those 
of a Poisson process, and for a„ ~ kv}!^ with k > 0, they do not converge to 
the moments of a Poisson process. Together with suitable upper bounds on the 
moments of Zn(t), this allows us to establish non-convergence to Poisson for k > 0, 
but, unfortunately, it does not allow us to establish convergence to some other 
distribution. 

The situation is slightly more "constructive" for the overlap distribution: here 
we are able to determine the limiting distribution of Qn,t for = nn^^'^ with 
K > 0. In this regime, the distribution of Qn,t converges to a convex combination of 
two shifted Gaussians: with probability 1/2 a Gaussian with mean and variance 
one, and with probability 1/2 a Gaussian with mean —k^ and variance one, so 
in particular Qn,t is not asymptotically normal. As we will see, this is closely 
connected to the failure of Poisson convergence; see Remark |4.2l in Section 01 and 
Remark Is .81 in Section O below. 

2.2. SK Spin Glass. We consider the SK model with energies given by H1.4|) and 
random coupling Xy which are i.i.d. standard normals. Let N ~ 2"^^, and let 
El, ... , En and (t^^\ . . . , cr*^^^ be as defined in the introduction. 

Given an energy scale q;„ > and two integers j > z > 0, we again introduce 
Qij as the random variable defined in H2.3|l . with r„ given by the condition that 
£'r„ < oin < Er^+i. Finally, we define Nn(t) to be the number of points in the 
energy spectrum of l|1.4|l that lie between «„ and + 1^„, with £^n given by 1)1. 6|l . 
We say that the local REM conjecture holds if Nn{t) converges weakly to a Poisson 
process with density one, and Qij converges in distribution to a standard normal 
for aU j > i > 0. 

Our proofs for the Npp can then be easily generalized to give the following 
theorem. 

Theorem 2.2. There exists a constant eq > such the following statements hold 
for all sequences of positive real numbers an with a„ < egn: 

(i) E(7V„(t)) ^ t as n oo. 

(ii) The local REM conjecture for the SK model holds if and only if an = o(n). 

3. Proof Strategy 

In this section we describe our proof strategy for Theorems 12.11 and 12.21 We 
explain our ideas using the example of the Npp, referring to the SK model only in 
the last subsection. Section 1X71 

3.1. Factorial moments. Consider a finite family of non-overlapping intervals 
[ci,di], . . . , [cmjrfm] with de > ce > 0, and let ji = dg — ci. Weak convergence 
of the process {Nn{t): t > 0} to a Poisson process of density one is equivalent 
to the statement that for each such family, the increments iV„ (c?i ) — iV„ (ci ) , . . . , 
Nn{dm) — Nn{cm) Converge to independent Poisson random variables with rates 

71, ■•■,7m- 
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Let Zn{a,b) be the number of point in the energy spectrum that fall into the 
interval [a, b]. We then rewrite Nn{di) — Nn{ci) as Z„(a^, 6^), where af^ = a + 
and 6^ = a + c?f^„, with ^„ defined in equation H2.2|) . Wc prove convergence of 
the increments to independent Poisson random variables by proving convergence of 
the multidimensional factorial moments, i.e., by proving the following theorem. To 
simplify our notation, wc will henceforth drop the index n on both the symbol E„ 
and P„. 

Theorem 3.1. Let p be as in Theorem \2.1\ let = 0(71^/"*) be a sequence of 
positive real numbers, let m be a positive integer, let [ci,di\, [c,„,dm] be a 

family of non- overlapping, non-empty intervals, and let (ki,..,km) be an arbitrary 
m-tuple of positive integers. For £ = 1, . . . ,m, set = q;„ + Cf^„, b^ — an + de£,n, 
and "f{ = d( — ci. Under these conditions, we have 

m m 

hm E[l[{Z,,{albi)),,]^l[^^', (3.1) 

£=1 e=i 
where, as usual, [Z)k = Z{Z — l)...[Z — k + l). 

Theorem |0] establishes that {Nn{t) : <: > 0} converges to a Poisson process with 
density one if = o(n^/^). The convergence of the overlaps is an easy corollary to 
the proof of this theorem. The details are exactly the same as in jBCMNOS] , and 
will not be repeated here. 

The failure of Poisson convergence for faster growing a„ is easiest to explain for 
the case in which Xi, . . . ,X„ are standard normals, since this does not require us 
to distinguish between the original and the modified REM conjectures. Our proof 
is again based on the analysis of the factorial moments of Nn{t). 

More precisely, we will show that E[Nn{t)] converges to t, while the second 
factorial moment, E[{Nn{t))2] does not converge to t'^. Note that this fact by itself 
is not enough to exclude convergence to a Poisson process since convergence of the 
factorial moments is, in general, only a sufficient condition for weak convergence to 
a Poisson process. But combined with suitable estimates on the growth of the third 
moment, the fact that E[(A^„(t))2] does not converge to t'^ is enough. This follows 
from the following lemma, which is an easy consequence of a standard theorem on 
uniformly integrable sequences of random variables (see, e.g.. Theorem 25.12 and 
its corollary in |Bi94| '). 

Lemma 3.2. Let Zn > be a sequence of random variables such that K[Zl2 is 
bounded uniformly in n for some r < 00. If Zn converges weakly to a Poisson 
random variable with rate 7 > 0, then lim„_>oo ]E[(Z„)fc] = for all k < r. 

Combined with this lemma, the next theorem establishes the third statement of 
Theorem 12.11 if the weights Xi, . . . , X„ are Gaussian. The proof for non-Gaussian 
weights will be given in Section [5.41 

Theorem 3.3. Let Xi, Xn G M 6e i.i.d. random variables with normal distribu- 
tion, and let an = o(y^). 

(i) E[Nn{t)] =t + o{l) for all fixed t > 0. 

(a) Let m. On, bn, •fg, and ki, .., km be as in Theorem \3.1\ For k = — 2; 

we then have 

m m 

EllliZniaiyn)),,] = (n7,^-')e^<e°(""""^)+°(^). (3.2) 
e=i 1=1 
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Theorem 13.31 will be proved in Section 0] and Theorem 13.11 will be proved in 
Section In the remainder of this section, we will map out the general proof 
strategy, and in the process, establish several properties which will be used in the 
proofs of Theorems 13. II and 13.31 

3.2. Analysis of first moment. In order to analyze the first moment we rewrite 
Zn{ai,bi) as 

Z„(a^,6l)=^/W(<T) (3.3) 

where I^^'>{cr) is 1/2 times an indicator function of the event that the energy E{(t) 
falls into the interval [af^, bf^] (the factor 1/2 compensates for the fact that the sum 
in H3.3|l goes over all configurations cr e { — 1,+1}", and therefore counts every 
distinct partition twice). Taking expectations of both sides we see that 

E[Z„(al,6l)] = i^p(i?(^) e [albi]). (3.4) 

cr 

Since we have taken p{x) = p{—x), the distribution of E{(j) is identical for all cr, so 
that the probability on the right hand side is independent of cr. In order to prove 
that E[Z„(a^, 6^)] = 7^(1 + o(l)), we will therefore want to prove that 

v(E{a) G ==2-("-i)7.(l + o(l)). (3.5) 

Rewriting the left hand side as 

v(e{<t) e [al = jj gn{y)dy, (3.6) 

where gn{') is the density of E(a) with respect to the Lesbesgue measure on = 
{x G M.: X > 0}, and recalling the definition ^„ — [2"~^(7„(a„)]~^, we see that the 
bound (|3.5() is equivalent to the statement that g„(i/) = 5n(an)(l + o(l)) whenever 
y = a„ + 0(^„). 

This bound is easily established in the Gaussian case, where it holds as long as 
ctn < Cyjn for some c < V21og2. Indeed, let us write £^(cr) as |iJ(cr)|, where 

1 " 

i/(<T) = — ^a,X,. (3.7) 

If the random variables . . . , X„ are standard Gaussians, the random variables 
H{(t) are standard Gaussians as well, implying that E{(t) has density 

g{y) = (3.8) 

with respect to the Lesbesgue measure on = {a; e M: a; > 0}. If < c-y/n 
for some c < -\/21og2, then ^„ — o(e~'^") for some e > 0, implying that for y ~ 
an + 0{£,n), we have g{y) = g{an){l + 0{y/n£,n)) = g(a„)(l + o(l)), as desired. This 
proves (|3.5|) and hence the bound E[Zn{afi, 6^)] = 7^(1 + o(l)) provided a„ < c-^/n 
for some c < -^2 log 2. Note that this already establishes the first moment bound 
stated in Theorem 13. 31 

For more general distributions, the proof is more complicated since 5„(a„) is no 
longer given by a simple formula like H3.8|l . But given our assumptions on p, we 
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will be able to show that under the assumption that an = o{^/n), it can be written 
in the form 

ff"K) = y|e-"^(""""'')(l + o(l)) (3.9) 

where G is an even function which is analytic in a neighborhood of zero and satisfies 
the bound 

G(2;) = y + 0(a-4). (3.10) 

The proof of the representation H3.9|l involves an integral representation for gn 
combined with a steepest descent analysis and will be given in Section [Q 

The bounds H3.9(l and (|3.1()(l have several immediate consequences. First, they 
clearly imply that .g„(a„) = g{an){i + o(l)) if a„ = o{n'^/'^), proving the first 
statement of Theorem 12.11 Second, they imply that ^„ decays exponentially in n 
if an = o(-y/n). Using the bounds (|3.9(l and H3.10(l once more, wc conclude that 
9n{y) = gn(an)(l + o(l) + O(an60) = 5n(an)(l + o(l)) whenever y = a„ + 0(^„). 
For an = o(y^), we therefore get convergence of the first moment, E[Z„(a^, bf^)] = 
7^(1 + 0(1)), implying in particular that E[Z„(i)] = t + o(l) as claimed in the last 
statement of Theorem 12. II 

3.3. Factorial moments as sums over pairwise distinct configurations. 

Next we turn to higher factorial moments. Before studying these factorial mo- 
ments, let us consider the standard moments E[J|^-^(Z„(a^, 6^))*''']. In view of 
(|3.3|) . these moments can be written as a sum over k configurations cr^^\ . . . , cr^*"'', 
where k ~ J2T=i ^e- As already observed in |BCP01| . the factorial moments can 
be expressed in a similar way, the only difference being that the sum over con- 
figurations is now a sum over pairwise distinct configurations, i.e., configurations 
cr(i), . . . , a-^''') e {-1, +1}" such that (t(*) ^ ±cr(j) for all i ^ j. Explicitly, 



m K 



(3.11) 

i?(<T«) e [al«,6l«] forallj-1,..., 



= - y 

cr(l),...,cr(^-): 

where the sums run over pairwise distinct configurations and = 1 if J = 
1, . . . , /fci, = 2 if j = fci + 1, . . . , fci + /ca, and so on. See |BCMN05j for the 
(straightforward) derivation of (|3.11|) . 

In order to prove convergence of the higher factorial moments, we therefore would 
like to show that the probability 

S(cr«) G K^^^fe^^^)] for aUj 

asymptotically factors into the product Hj P(^('''^''') ^ [an^\bn^^]). Unfortu- 
nately, this asymptotic factorization does not hold for arbitrary families of distinct 
configurations cr^^^ . . . cr^^'^ . This problem is already present for a„ that are bounded 
as n ^ 00 (see |BCMN0'5] '). and - in a milder form ~ it is even present for the special 
case of a„ = treated in |BCP01j . 
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3.4. Typical and atypical configurations. As in |BCMN05j and jBCPOlj . it 

is useful to distinguish between "typical" and "atypical" sets of configurations 
(t(^), crC') when analyzing the right hand side of H3.11|l . To this end, we con- 
sider the matrix M formed by the row vectors cr^^), . . . , cr*^*^^. Given a vector 
S e { — 1, 1}'^, we then define ns{cr'-^\ ...jCr^*^)) as the number of times the column 
vector S appears in the matrix M, 

ns = n,(<T(i), <tW) = \{j < n : af) = S}\. (3.12) 

If (T^^\ . . . , crC^' e {^1, +1}" are chosen independently and uniformly at random, 
then the expectation of ns is equal to n2~*'' for all i5 G {— Ij+l}*^. By a standard 
martingale argument, for most configurations, the difference between ns and n2~'^ 
is therefore not much larger than ^/n. More precisely, for any An — > cx) as n — > cx), 
all but a vanishing fraction of the configurations cr^^\ . . . , crC^' obey the condition 

max|n5((T(i),...,cr('^)) - ^| < x/^A„n, (3.13) 

see Lemma 3.9 in IBCMNOS] for a proof. 

The proof of Theorems l3.1l and l3.3l now proceeds in two steps. First, we show that 
the contribution of the configurations that violate H3.13(l is negligible as n — > oo, and 
second we analyze the configurations satisfying (|3.13|) . It turns out that first part is 
quite complicated and requires distinguishing several sub-classes of configurations, 
but this analysis has already been carried out in (BCMNO's] . resulting in bounds 
that are sharp enough for growing a„ as well. So the only additional work needed 
is a sharp analysis of the typical configurations. 

The next lemma summarizes the main results from |BCMN0'5] needed in this 
paper. To state it, we define Rn,ki^n) as 

i?„.fe(A„) = ^ Y.' e for aU J = 1, . . . , fc) , (3.14) 

where the sum runs over pairwise distinct configurations cr^^-' , . . . , cr^*^^ that are 
either linearly dependent or violate the bound H3.13|l . Wc also use the notation 

g„,ax = max|g(cr«,,T(-''))| (3.15) 

for the maximal off-diagonal overlap of cr'^', . . . , crC^'. 

Lemma 3.4. Let A„ be a sequence of positive numbers. 

(1) Then the number of configurations cr^^^, cr^*^) that violate the condition 

is bounded by 2"'=2'=+ie~5^n . 

(2) Assume that both an and A„ are of order o(^/n). Then there are constants 
c, C < oo depending only on k, 71, . . . ,7^, and the sequence A„, such that 
for n sufficiently large we have 

i?„,fc(A„) < Cn^e^'^"'(i+°(i))(ey"° +e-^"/2). (3.16) 

Here no = (1 + e)/e with e as in H2.1|l . 

(3) Let (T^^\ ...,(t('^) be an arbitrary set of row vectors satisfying (|3.13|) . Then 

9max < 2^=^, (3.17) 
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and hence for Xn ~ o{^yn) and ?i sufficiently large, we have that(T^^\ ...^cr^'^^ 
are linearly independent. 

Proof. Statements (1) and (3) are copied from Lemma 3.8 in jBCMNOS] . while 
statement (2) is a consequence of the bounds (3.65) and (3.67) in [BCMNOS] and 
the fact that 2"^„ = e^""^^"''°'-^'^e'^^^\ a bound which follows immediately from 
(ESJ, ifTTUIl . and the definition lE3 of in- □ 

3.5. Factorization for typical configurations. In view of Lemma 13.41 it will 
be enough to analyze the expectations on the right hand side of (|3.11() for config- 
urations cr^^\ (T^^^ that satisfy (|3.13|) and are linearly independent, provided we 
choose Xn in such a way that q;„ ~ o(A„); A„ = o(\Ai) e^^^l'^ decays faster 
than any power of n. 

Consider therefore a family of linearly independent configurations satisfying the 
condition 1)3. 13|) . The main technical result of this paper is that the following 
approximate factorization statement 



'(S(cr(j)) e [a^(^\ for aU j = 1, . . . , fc) 



is valid whenever n is large enough and c^^', cr^'^) obey the condition 13.13|1 . (For 
the non-Gaussian case, we will also need that = o{n^l^\ while the assumption 
an — o{y/n) stated at the beginning of this subsection is enough in the Gaussian 
case.) 

If Xi, . . . , Xn are standard normals, the bound H3.18|l is quite intuitive and not 
hard to prove. Indeed, let H{a) be the random variable defined in (|3.7|1 . and let 
k(*^^(-) be the joint density of H{cr'-^^), . . . , H{cr^''^). Using the notation x and y 
for vectors with components xi, . . . ,Xk and yi, . . . ,yk, respectively, we then express 
the joint density of i?((T(^)), . . . , E{(t^''^) with respect to the Lebesque measure on 



I'l as 



Xl,...Xk- 

yi=±xi 

If Xi, . . . , Xn are standard Gaussians, the joint distribution of if(cr(^'), . . . , H{<j'^^^ ) 
is Gaussian as well, with mean zero and covariance matrix 

Cy = £:[i7(o-('))i/(cr(^))] ^ g(cr«,cr(^)), (3.20) 

leading to the representation 

K^^^x) = , \„ A^ cxpf--(x,C'ix)V (3.21) 
^ ' (27r)'=/2Vd^ 2^ ' ' 

whenever C is invertible. As usual, C^^ denotes the matrix inverse of C, and 
(x, C^^x) = J2i.j ^i^ij^^j- Observing that C~j^ = 6ij +0((?max), this immediately 
leads to the bound (|3.18|) : see Section 0] for details. 

In the non-Gaussian case, we do not have an explicit formula for the joint density 
of E{cr'^^^), . . . , E{(T^''^), thus making the proof of the approximate factorization 
formula (|3.18)l much more difficult. Basically, the proof requires a multi-dimensional 
local limit theorem for a case in which the arguments of the probability density 
under investigation grow with n. In order to prove this local limit theorem, we will 
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use an integral representation for the probabilities on the left hand side of (|3.18|l 
and then use a saddle point analysis to prove (|3.18|l . In contrast to jBCMNOS] and 
|BCP01j . where it was sufficient to analyze the saddle point of the integral in the 
original domain of integration, the case with growing a„ considered here requires 
a more sophisticated analysis, involving the shift to a complex saddle point in a 
complex space of dimension k; see SectionElfor the details. 

Let us close this section by showing how to complete the proof of Theorem 13.11 
once the bound (|3.18|) is established. To this end, let us first consider the case 
where a„ grows somewhat more slowly than o(n^/^), in particular assume that a„ = 
o{n^^^). Choosing A„ = n.-^^^, we then invoke the bound H3.17|l from Lemma [3.41 
(3) to conclude that Qmax = 0(71^^^"^) and a^g,nax ~ o{l) whenever cr^^), tr^'^' 
satisfy the bound H3.13|l . For a family of configurations satisfying H3.13|l . the the 
multiplicative error term in the joint probability on the right hand side of H3.18|) is 
therefore equal to 1 + o(l), leading to asymptotic factorization of the probabilities 
on the right hand side of (|3.11(l . Using Lemma 13.41 (2) to bound the sum over 
families of configurations not satisfying (|3.13(l . and Lemma|^31(l) to show that the 
number of families of configurations satisfying H3.13|l is equal to 2"'^'(1 + o(l)), this 
gives the bound (|3.1|) from Theorem 13. II 

For still more quickly growing q:„, it is not enough to use the worst case bound 
(j3.17|l . Instead, we would like to use that a typical family of configurations has 
maximal off-diagonal overlap of order n~^/'^. For a typical family of configura- 
tions, we therefore get asymptotic factorization as long as a„ = oin^/^). The next 
lemma, to be proved in Section [3 implies that the error term coming from atypical 
configurations does not destroy the asymptotic factorization. 

Lemma 3.5. Let c > 0, let k be a positive integer, and let a„ be a sequence of 
positive numbers such that anU^^^^ as n ^ oo. If f is a function from the 
set of all families of configurations (T^^-*, . . . , cr^'^) G { — 1,+!}" into R such that 
|/(cr(i),...,cr(^))| < m2g„,ax; then 

e/(-<''---''') = l + o(l) (3.22) 



E 



as n ^ oo. 

We will now show that Lemmas 13.41 and 13.51 combined with H3.18|l and (|3.5|) 
immediately imply Theorem 13.11 Indeed, let us choose A„ in such a way that 
an = o(A„), A„ = o{y^) and e^^"/^ decays faster than any power of n. Invoking 
(|3.11|l and using Lemma l3.4l f2) to bound the sum over configurations which do not 
satisfy (|3.13() and the bounds 13.18|1 and H3.5|l to approximate the remaining terms 
on the right hand side of ()3.11(l . we get the estimate 

m k 

E[n(^«(4,6l)).,]=(l + o(l))(n7.w)2-"'^ E" 

i=l i=l ct(1),...,<t('=) 

(3.23) 

where the sum runs over families of linearly independent configurations (t^^\ 
cr*^*^) that satisfy H3.13|l . Using Lemma [3.41 fl) to extend the sum to a sum 
over all families of configurations cr'^', . . . , cr^'^) e {^1- we then refer to the 

statement of Lemma [3.51 to complete the proof of Theorem 13. II 

The above considerations also indicate why the asymptotic factorization of the 
factorial moments fails if a„ grows faster than o{n^^^). Indeed, expressing the 
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matrix elements of C as Cij = 5ij + qij where qij ~ q{cr'^^\ cr^^^) \ii ^ j and qij ~ 
if i = j, (x, C^^x) = J2j x] - Ei^^j x.iq^jXj + 0(gmaxl|x||i), where, as usual, ||x||2 
denotes the ^2 norm of x. This in turn leads to the more precise estimate 



^\j[{Zn{alK,))u, 



(n 



-nk 



E 



(3.24) 



■E.^, 9.3+0(a^<3Lx)+o(l) +o(l) 



r(i) ,rW 



for the case where Xi, . . . , X„ are standard normals. While the term ^ X^i/j 9y 
is negligible if Q!„ = oin^/^), it becomes important as soon as a„ grows like n^/'^ or 
faster, leading to the failure of asymptotic factorization for the factorial moments. 
The details are straightforward but a little tedious and are given in Section 01 for 
the Gaussian case, and in Section 15.41 for the non-Gaussian case. 



3.6. Integral representation. As discussed in the preceding subsections, the con- 
vergence of the factorial moments of Z„ reduces to the proof of the approximate 
factorization formula (|3.18|l . Following a strategy that was already used in |BCP01| 
and [BC^MNOH] , our proof for the case where Xi , . . . , X„ are not normally dis- 
tributed uses an integral representation for the probabilities on the left hand side 

of Jirra . 

To derive this integral representation we first express the indicator function 
I^^\a) in terms of the function rect(x) defined to be 1 if \x\ < 1/2 and oth- 
erwise. Using Fourier inversion and the fact that the Fourier transform of the 
function rect(a;) is equal to sinc(/) = ^^fj^ this leads to the representation 

/oo 
sinc(/g„,,) cos(27r/i^„V^)e2"/Sr=i (3 35) 

-00 



where tf^ = + denotes the center of the interval [a^,6^], and qn,e = 

lt£.n\/n. Taking the expectation of both sides of (|3.25|l and exchanging the expec- 
tation and the integral on the right hand side (the justification of this exchange is 
given by Lemma 3.2 in IBC'MNOKj ). we get the representation 



»(i?Me[al,6l])=2E[/(^)(^)] 

,00 (3.26) 
= 2qnj / sinc(/g„,£)cos(27r/tfjV")/5"(/)d/ 



where p{f ) = E[e^''*'^"^] is the Fourier transform of p. 

When deriving an integral representation for the terms on the right hand side 
of (|3.11|) . we will have to take the expectation of a product of integrals of the 
form H3.25|l . Neglecting, for the moment, problems associated with the exchange of 
integrations and expectations, this is not difficult. Indeed, rewriting the product of 
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integrals as 

^ poo 



/oo 
. -OO 

(3.27) 



k 



n n sinc(/,<z„,,(,)) cos(27^/,^l(^")^/^)d/•, 

s=l 3 = 1 



where 



Vs = Y.o^^^f,, (3.28) 

i=i 

and taking expectations of both sides, we easily arrive at the representation 



i?(<T(j")) G for all .7 = 1,..., fc) = n(29-^)'^ 

(3-29) 

n n sinc(/jg„,£(j)) cos(27r/jt^(j) V^)d/j. 



But here a little bit more care is needed to justify the exchange of expectation and 
integrals. Indeed, this exchange can only be justified if (t^^\ . . . jCr^^' arc linearly 
independent (see Lemma 3.5 in IBCMNOSj l. but luckily, this will be all we need. 

3.7. The SK Model. As in the proof for the Npp, we define Z„(a,6) to be the 
number of points in the energy spectrum that fall into the interval [a, 6], where the 
energy of a configuration cr is now given by (|1.4|) . Given a family of non-overlapping 
intervals [ci, di], . . . , [cm, dm] with cg > 0, and = q 4- 7£ > C£ for £ = 1, . . . , m, 
we now consider the intervals [a^, 6^] with = a -I- and 6^ = a + di^n, where 
^„ is defined in (|1.6|1 . Theorem 12 . 21 now follows immediately from the following two 
theorems. 

Theorem 3.6. Let «„ = o{n) he a sequence of positive real numbers, let m he a 
positive integer, and for £ ~ 1, . . . ,m, let a'^ and &^ he as above. For an arbitrary 
m-tuple of positive integers (fci, ..^km) we then have 

m m 

Jim E[n(^««, hi))k,] = n 7^- (3.30) 

~* £=1 1=1 

Theorem 3.7. There exists a constant eo > such the following statements hold 
for all c > and 7 > 0, all sequences of positive real numbers an with a„ < e^n, 
and a„, 6„ of the form a„ = a„ -|- c^„, 6„ = a„ -f (c -f 7)f„. 

(i) \imn-,oo'K[Zn{an,bn)] =7. 

(ii) E[(Z„(a„, 6„))^] is bounded uniformly in n. 

(Hi) //limsup^ > 0, then limsup„_o^ E[(Z„(a, 5))2] > 7^. 

By now, the proof strategy for these theorems is straightforward: As before, the 
first moment is given as an integral over the energy density. For the SK model, this 
density is given bv p.5|l . leading to E[Z„(a^, &^)] = 7^(1-1-0(1)) as long as ^„ = o(l). 
To analyze the higher factorial moments, we again use a representation of the form 
(|3.11|l . Neglecting for the moment the issue of bounding the sum over atypical 
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configurations, we will again study the factorization properties of probabilities of 
the form 

P(i?(^(^") ) e , for all J = 1, . . . , fc) . (3.31) 

For the SK model, this is even easier than for the NPP with Gaussian noise, since 
E{a) is now itself a Gaussian random variable, rather than the absolute value of a 
Gaussian. The joint distribution of E{(t'^-^^), . . . , E{(t^''^) therefore has density 

^^'^'(x)=(^^oxp(-i(x,C-x)), (3.32) 

where C is the covariance matrix 

C,j = E[^(crW)^(cr(j))] = n((7(cr«, crW))^. (3.33) 

Expanding C~j^ as C^^^ = -^{Sij + ©(gj^jj^^)), wc then get the following analogue of 
the factorization formula H3.18|l : 



p(i?(<T(-'")) e K^'\bi^^^] for all J = 1, ... , fc) 



(3.34) 



In the exponent of the above expression, note that the additional factor of 
relative to the analogous expression for the Npp is simply a consequence of the 
different normalizations of the energies. The more significant difference is the factor 
of 9max rather than (/max, a consequence of the difference between the covariance 
matrices for the two problems. For typical configurations with overlap gmax = 
0(n~^/^), equation H3.34|l suggests that there will be asymptotic factorization if 
and only if a„ = o{n). That this is indeed the case is established in Sectional 



4. The Npp with Gaussian densities 
In this section we analyze the factorization properties of the probabilities 

P (^E{(T^-^^) e [a'J'\bi^''>] for J = 1, ... , /fc) (4.1) 

for the case where Xi, . . . ,Xn arc standard normals. Throughout this section, we 
will assume that a„ — o{y^). 

In a preliminary step, we show that it is possible to approximate the joint density 
(;('=)(y) of i?((T(-'^'), . . . , E{(T^''^) by its value at yi = an- To this end we combine 
the representations H3.19() and H3.21|l with the fact that is bounded uniformly 
in n if ^max = o(l). For yi £ [an^\ ^n'']? we then have 

g^^\y) = 5('=)(«n)(l + O(an60 + OiCd) = g^''\c,u){l + o(l)), (4.2) 
implying that 

rn 

p(ii;(^W)e [alW,6l(^)] forj = l,...,A:) = {j{{inl,f')9^''\o^n){l+o{l)). 

£=1 

^ (4-3) 

Having established this approximation, we now proceed to prove the factorization 
formula 
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4.1. Proof of the factorization formula H3.18|l . Let us express the right hand 
side of 1)4. 3|l as a sum over vectors x € M'^ with \xi\ = a„, see Recahing the 

representation H3.21|l . we expand (x, C~^x) as (x, C~^x) = ||x||2(l + 0((7max)) = 
/cQ!^ + 0(gmaxa^), where, as before, ||x||2 denotes the ^2-norm of x. For q^ax = o{l), 
we further have det C = 1 + o(l), implying that 



Recalhng the approximation P|^i?((T) e [a„,5„]j = g(Q!„)7£^„(l + o(l)) estabhshed 

in Section 13.21 the bounds H4.3|l and (|4.4|l imply the approximate factorization 
formula (|3.18() . Note that the only conditions needed in this derivation were the 
conditions = o{y/ri) and ^max = o{l). Taking into account Lemma |3. 41 fiii^. we 
therefore have established that H3.18|l holds whenever = o(-y/n) and cr'-^\ tr^'^' 
obey the condition H3.13|l for some A„ of order o(-y/n). 

As shown in Section 13.51 the approximate factorization formula 1)3.18)1 immedi- 
ately leads to Poisson convergence if = o{n^^^). To establish that this conver- 
gence fails for faster growing q;„, requires a little bit more work. This is done in 
the next subsection. 

4.2. Proof of Theorem 13. 3L We start again from ()4.3)l . but this time we expand 
the inverse of C a little further. Explicitly, expressing the matrix elements of C as 
Cij ~ Sij + qij where = q{cr^'^\tT^^^) if i ^ j and tjij = if j = j, we clearly have 



(x, C ^x) = ^ - ^ x,q,jXj + 0{q 



' I 

max I 



(4.5) 

= kal - x,q,jXj + 0{ql,^^al) 

whenever gmax = o(l) and \xi\ = a„ for all i. With the help of ()4.3)l . 1)1.3)1 . ()3.19)) 
and 1)3. 21)1 . this implies that 



P {e{(t(^^) e [al^J'), for J 1, . . . , /c) 



(n 2""' E c'^pI 2 E + 0(9Lx"' ) + o(l) 



= 1 XX,. ..Xk- i^j 

=±a„ 



Let us now choose A„ in such a way that Q!„ = o(A„), A„ = o{^/n) and e^^^/'^ 
decays faster than any power of n. Combining ()4.6)l with the representation ()3.11)) 
and Lemma 13.41 (2). we then have 

m k 

E[n(^«(«f.6i)),,] = (n7.w)2-"' E" 2- 



-k 



1=1 i=l 



X (4.7) 
E exp(^- + O(gLxan) + o(l)j + o(l) 



where the sum runs over families of linearly independent configurations (t^^\ 
. . . , (T^^) that satisfy 1)3.13)1 . Next we claim that we can extend this sum to a sum 
over all families of configurations (t^^\ . . . , cr^*"'' at the cost of an additional addi- 
tive error o(l). Indeed, by Lemma 13.41 fl) and (3), the number of configurations 
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(t(^\ . . . , cr^''^ that are linearly dependent or violate the bound is bounded 

by a constant times 2"'^e~^"/^. Since the terms in the above sum are all bounded 
by e*^^""-* = e°('^"\ the extension from the sum to a sum over all families of 
configurations cr^^\ . . . , tr^'^) indeed only introduces an additive error o(l). Ob- 
serving finally that the exponent '^i^j XiQijXj is invariant under the transformation 
Xi \xi\ — an and cr^*) sign{xi)(T^^\ we obtain the approximation H^{.24(l . which 
we restate here as 

m 

E[Y[{Zn{ai,bi))k,' 

e=i 

= (ri^^«)2-"' E cxp(^5]g,,+0(qLx"^)+o(l)) +o(l). 

i=l <t(i)....,ct('') i^j 

(4.8) 

With the approximation (|4.8|l in hand, the second statement of Theorem 13 . 31 now 
follows immediately from the following lemma. 

Lemma 4.1. Let C < oo, let /?„ = o(n), let denote expectation with respect to 
the uniform measure on all families of configurations (t^^\ . . . , (t'^^\ and let R de- 
note a function on families of configurations such that \R{cr^^\ . . . , cr^'^^)\ < Cq^^^. 
Then 



Efc 



exp 



))]=exp( 



k{k-l) 



Pi + 0{p-in-')+o{l) 



(4.9) 



Proof. The complete proof of the lemma will be given in Section [7. 21 Here we only 
show that the leading term behaves as claimed, namely 

Efc [exp(/3„ 9^)] = cxp( ^^^~^^ /3g + 0{pln-^)) . (4.10) 
To this end, we observe that 



- n k 



As a consequence, we have 



s=l i=l 



k. 



(4.11) 



E, 



e-''"E.[nexp(^(^.«)^) 



E, 



ff3n 

exp I — 

V n 



(4.12) 



i=i 



2\ 1 ri 



where Efc denotes expectation with respect to the uniform measure on all configu- 
rations S = ((5i, . . . , Sk) S {—1, Next we expand the expectation on the right 
hand side into a power series in /3„/n (recall that we assumed /3„ = o(n)). Using 
that the expectation of equal to fc, while the expectation of (^,- 5iY is 
equal to ?>k^ — 2k, this gives 
k _ 



exp 



f3„ 



E^' 



expf ^k + ^k{k - 1) + OiPln-^) 



(4.13) 
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and hence 



exp 



k{k - 1) 



(4.14) 
□ 



Remark 4.2. For the special case where a„ grows like n^^'^, Theoreni l3 . 3l imphes that 
the density of the process Nn{t) converges to one, while the process itself does not 
converge to Poisson. Another interesting consequence of our proof is the following: 
Consider the rescaled overlap Qn,t of two typical configurations contributing to 
Nn{t) (see Section O for the precise definition of Qn,t)- Then Qn.t converges in 
distribution to the superposition of two Gaussians with mean ±k^, where k = 



lini,. 



-1/4 



lim P(Q„,t > y) 



+ e 



-dx. 



(4.15) 



This follows again from (|4.6|) : in fact, now we only need this formula for k = 2, 
where the sum over x just gives a factor cosh{a^i^q{cr , cr)). This factor is responsible 
for the shift of in the limiting distribution of Qn.t = n^^^q{cr, cr). 



5. The Npp with general distribution 

In this section we prove the first moment estimate (|3.5ll and the factorization 
formula (|3.18|) for arbitrary distributions p obeying the assumptions of Theorem l2.1l 
see Propositions IS . 2l and l5 . BI below. As discussed in Section^ this immediately gives 
Theorem 13.11 We also show in Section 15.41 how the proof of non-convergence can 
be generalized from the Gaussian case to the general distributions considered in 
Theorem O 

5.1. Properties of the Fourier transform. Throughout this section, we will 
use several properties of the Fourier transform in p which we summarize in this 
subsection. 

(i) For any n > Uq, where uq is the solution of + -j^- = 1 with e as in H2.1|l . 
we have 

/oo i>OD 
\p{fr< / i/5(/)r = co<oo. (5.1) 

(ii) There exists po > such that p{f) is analytic in the region |3m/| < pQ. 

(iii) For any pi > there exists ci > and p2 > and such that 

l/5(/)l<e-^^ (5.2) 

whenever |SJm/| < p2 and |3?e/| > pi. 

These properties easily follow from our assumptions on the density p. The bound 
(|5.1|) is a direct consequence of H2.1|) . Analyticity of p in a neighborhood of the 
origin implies existence of exponential moments and this in turn implies analyticity 
in a strip about the real line. To see that the bound H5.2|) is true, we first observe 
that it obviously holds when / is real. From the fact that dp{z)/dz is bounded 
uniformly in z as long as the imaginary part of z is small enough, we can choose 
P2 in such a way that the bound H5.2(l extends to |9m/| < p2 (with ci replaced by 
a slightly smaller constant, which we again call ci). 
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5.2. First Moment. In this section we establish the representation l|3.9|l for the 
density gn{-), see Proposition l5. 21 below. As explained in Section POl this represen- 
tation immediately give the first moment estimate (|3.5|l for a„ ~ o{y/n). 
We will start from the integral representation 




This formula can easily be derived by first expressing the density of H{(t) in terms 
of the n-fold convolution of p with itself, using Fourier inversion to express this 
density as an integral over p" . and then summing over the two possible choices for 
the sign of H{a) given E{(t) (alternatively, the formula can be derived from (|3.26() 
by sending 7^ to 0). 

In order to determine the asymptotic behavior of gn{a), one might want to 
expand p"{f) about its maximum, i.e. about / = 0. While this strategy works well 
for bounded a„, it needs to be modified in the case of growing a„. Here we will 
use the method of steepest descent, a method which was first introduced into the 
analysis of density functions by a paper of Daniels |Dan54j . though of course the 
ideas go back to Laplace. 

Before explaining this further, let us first note that, asymptotically, the integral 
in (|5.3|l can be restricted to a bounded interval about zero. Indeed, let fii > he 
an arbitrary constant, let p,2 be as in (|5.2|l . and let hq be as in (|5.1|l . For ii > hq 
and I/I > pi, we then have |p"(/)| < |/5"''(/)|e~'^i'^"~"°^ As a consequence, the 
contribution to the integral H5.3|l from the region |/| > fii is bounded by a constant 
times Y^e"'^^"', giving the estimate 

5„(«) = 2V^ r cos(2^/a^/H)/3"(/)4f + 0(^/^6-^^"). (5.4) 
-'-Ml 

Next we observe that both the range of integration and the function / i-^- p(f) are 
invariant under the change f ^ —f, implying that we can rewrite the integral as 

2V^ r e2"*/""^/5"(/)d/ = r e""^^(^^~'"^^^d/ (5.5) 
where F{f) is defined by 

p(/) = e-^(/). (5.6) 
As defined earlier, let fxo be such that p(f) is analytic in the region |3m/| < fiQ. 
Further let < 77 < /io, and let C be the path in the complex plane obtained 
by concatenating the three line segments that join the point — /.Ji to the point 
—pi + ir], the point —pi + irj to the point pi + i-q, and the point pi + ir] to the 
point pi, respectively. Let Q be the region bounded by C and the line segment 
from —pi to pi. Then the function / 1-^ p(/) and hence the integrand in H5.5|) is 
analytic in Q, implying that the integral on the right hand side of 15. 5|) is equal to 
the integral over C. Our next lemma states that the contribution of the first and 
third line segment to this integral is negligible, effectively allowing us to "shift the 
path of integration" into a parallel line segment in the complex plane. 

Lemma 5.1. Given pi > 0, there are constants ci > and P2 > such that 

gM = r'^'" e^"(^(^)"'"^^)d/ + OiV^er^n (5.7) 
provided q;„ > and < r/ < p2- 
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Proof. Taking into account H5.4(l . we need to analyze the integral on the right of 
(|5.5(l which in turn is equal to the integral over the path C defined above, provided 
we choose /i2 < Mo- 
Consider the integral 

/'"'"^'"'e-"(^(-^'-^-^^^)d/. (5.8) 

For / in the line segment joining —fii to — /ii + ij], the imaginary part of / is 
non- negative, implying that \e^'^"^^'^\ < 1. Due to property 1)5. 2|1 of the Fourier 
transform p, we furthermore have that |/5(/)| = je"^*^-^-*! < e~'^^ on this line segment. 
As a consequence, the above integral is bounded by rje^'^^" = 0(e~'^i"). In a similar 
way, the integral over the third line segment is bounded by 0(e~'^^"'). Taking into 
account the multiplicative factor of in H5.4|l and the fact that the error term in 
(|0|) is bounded by 0{y/ne^''''"), this proves (|57|) . □ 



Note that the above lemma holds for arbitrary 77 > 0. As usual when applying 
the method of steepest descent, the value of 77 will be chosen in such a way that 
the asymptotic analysis of the integral on the right becomes as simple as possible. 
Here this amounts to requiring that at f ~ irj, the first derivative of the integrand 
is zero. In other words, we will choose rj as the solution of the equation 

F'iiT]) = 2Tn^, (5.9) 

where F'{z) — ^^^^ denotes the complex derivative of F. This leads to the follow- 
ing proposition. 

Proposition 5.2. There is an even function G{x) which is real analytic in a neigh- 
borhood of zero such that 

ff"K) = y|e-"^(""""''')(l + o(l)) (5.10) 
whenever a„ = o{^/ri). In a neighborhood of zero, G{x) can be expanded as 

G{x) = ^ + 0{x^). (5.11) 
For a„ = o(-\/n) and a^, 6fj and as in Theorem \S.l\ we therefore have 

p(i?(<T) e [aiX]) = 2-("-i)7,(l + o(l)). (5.12) 

Proof. We first argue that the equation 1)5.9(1 has a unique solution which can be 
expressed as an analytic function of a; = cxnl \pn. Consider thus the equation 

F\iT]) = 2-Kix. (5.13) 

Since p is an even function which is analytic in a neighborhood of zero with p[fS) = 1, 
the function F is even and analytic in a neighborhood of zero as well. Taking into 
account that the second moment of p is one, we now expand i^(/) as 

F{f)^\{2^ff^O{,f^) (5.14) 

and F'{f) as 

i^'(/) = (2^)V + 0(/'). (5.15) 
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By the implicit function theorem, the equation H5.13|l has a unique solution 'i]{x) in 
a neighborhood of zero, and ri{x) is an analytic function of x. Taking into account 
that F' is odd, wc furthermore have that ri(x) is an odd function which is real 
analytic in a neighborhood of zero, and expanding rj{x) as i]{x) = j^x + 0{x^) we 
see that 77(2;) > if x > is sufficiently small. 

Having established the existence and uniqueness of ri(x) for small enough x, we 
are now ready to analyze the integral in 1)5. 7|l for rj — r]{x) and x = an/\/n = o(l). 
To this end, we rewrite the integral as 

n{Fiir,{x) + f)+27,x{7jix)-if)) ,r -nG{x) ~nF{S) 



^-nyr{iri[X) + j)+z7TX[7j{X)-ij)j ^ ^-nu(x) ( e~'^^' ' df (5 16) 

-Ml •'-Ml 

where 

G{x) = FiiTj{x)) + 2nxr]{x), (5.17) 

and 

F{f) = F{iri{x) + f)~ Fiirj{x)) - 2mxf. (5.18) 
Next we would like to show that the integral on the right hand side of (|5.16|) can 
be restricted to |/| < logn/^/n. To this end, we expand F{f) about / = 0. Taking 
into account the fact that the first derivative at / = vanishes by the definition of 
77(0;), we get 

F{f) = If^''> {iri{x))f + {iri{x))f + 0{f). (5.19) 

Using again that F is an even function of its argument, we conclude that F^'^\iri{x)) 
is real, while F^'^\irj{x)) is purely imaginary, so that 

5Re F{!) = 1^(2) {ii^{x))f + 0{f). (5.20) 

Since 77(0;) = 0{x) = o(l) and F^'^\iri{x)) = in^ + 0{ri{x)'^), we have that 
5Re F{f) > p provided /ii is sufficiently small and n is sufficiently large. As a 
consequence, we get that 

/Ml ~ /•n^'^^^logn -1 , 

e-nFif)df^ e-^nf)df + o(^e-'°^"A. (5.21) 

For I/I < log7i/-y/n, we now expand 

-"^(/) = cxp(-|F(2)(^7^(x))/2 + Oinf)) 



e 



= exp 

leading to the approximation 



-^F(2)(*7y(x))/2)(l + 0(n/3)), 



(5.22) 



""'''^/ = \ L.JL..A I + Oin-'/')) - + oil)). 



e 



n-i/2iogn y nF(2)(ir;(x)) V27m 

(5.23) 

Combined with H5.7|l . (|5.16|) and (|5.21|) this proves (|5.1U|I . 

Next, we show that G{x) is an even function which is real analytic in a neigh- 
borhood of zero and obeys the bound H5.11|l . But this is almost obvious by now. 
Indeed, combining the fact that F is an even function which is real analytic in 
a neighborhood of zero with the fact that r]{x) is an odd function which is real 
analytic in a neighborhood of zero, we see that G{x) is an even function which is 
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real analytic in a neighborhood of zero. With the help of l|5.14() . the bound (|5.11() 
finally follows by inserting the expansion 7j{x) = j^x + 0{x^) into the definition 

HFTWi of G. 

As we already argued in Section lX^ the bound (|5.12() finally follows immediately 
from the remaining statements of the proposition. □ 



5.3. Higher Moments. In order to establish convergence of the higher factorial 
moments, we start from the integral representation (|3.29|) . Recalling the definition 
(Eiai, let 

nmi„ = n„,in(<T(i',...,<T(")) =min{n5: (5.24) 



In a first step, we show that under the condition that rimin > uq where uq is the 
solution of + ^ = 1 with e as in (|2.1|l . the integral in (|3.29(l is well approximated 
by an integral over a bounded domain, with the product of the sine factors replaced 
by one and the various midpoints tf^ replaced by a„. 

Lemma 5.3. Given /ii > there exists a constant ci > such that for an ~ o{\/n) 
nmin{c'^^\ •■j"'^*''') ^ '^0 have 



E{cT^=^) € K«, /or J = 1, ... , fc) = \{{2qn,,) 



e=i 

k 

[]p(f • J] cos(2^/,«„V^)d/, + 0(e-^^"— ) + OiV^Cn 

'"''1 S j=l 

(5.25) 

Here the product over S runs over all 6 G { — 1, +1}*^ and f • 6 stands for the scalar 
product J2j fj^j- 

Proof. Consider the integral on the right hand side of H3.29|l . In a first step, we will 
consider the contribution of the region where \fj \ > for at least one j and show 
that it is negligible. To this end, it is clearly enough to bound the contribution of the 
region where l/jl > Mi- But if J2'j=i \fj\ > Mii then there is a ^ G {— 1,+1} 

such that I X]j=i ^jfj\ > Ml- Since > 1 for all S e { — 1, +1}'^, we conclude that 
there exists an s G {1, . . . , n} such that \vs\ > Hi- 

Thus consider the event that one of the Iwsl's, say |wtj, is larger than fj,i. Let 
^1 = Wtl\ --j^tf''}j ^'^d let ^2, ■■■,^k be vectors such that the rank of the matrix 
A formed by ..,dk is fc. Let {ftj, ■■■,vtk} be defined by 



Y^Slf,. (5.26) 



Since A has rank fc, we can change the variables of integration from fj to Vt^. Let 
the Jacobian of this transformation be J^, i.e., let Jk = |det A|^^ where det A is 
the determinant of A. Since A has entries ±1 and is non-singular, we conclude that 
I det A I > 1, implying that \Jk\ < 1. We therefore may bound the integral over the 
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region where where |wtj > by 

n k 
oO"'|yti|>Ml ,,= 1 j = l 

< [[ri[\p{vt,)ndvt^x f \piv,,)r^dv,, (5.27) 

< (Co)'^-i / \p{yt,T'^dvt, < Co^e-^i 

"'|i'ti |>'^i 

where Co and ci are as in H5.1|l and H5.2|l . Since the number of choices for St^ is 
bounded by 2*^, the bound H5.27|l imphes that the contribution of the region where 
at least one of the |/j|'s is larger than pi is bounded by 0(e~'^i"'"'"), with the 
constant implicit in the O-symbol depending on k. 

Noting that <lnj(j) = 0(^/n^„), we finally observe that in the region where 
l/il ^ Ml for £''11 J j we may expand shic{qnj{j)fj) as l + 0{^/ri£_n) and cos{2tt f jtn^/n) 
as cos(27r/ja„-yn) + 0{s/n^n)- Rewriting the product Hs Pi'^s) as 

n 

X{p{vs)^ n P(f-^r, (5-28) 

s=l 56{-l, + l}'= 

this gives (|5.25|l . □ 
Next we rewrite the integral in (|5.25|) as the average 



J] p(f • ^)"^ J] cos(27r/,a„ V^)d/, 
Ml 5 j=i 



Ml 



(5.29) 



=2"' E /// e™-n^(f •'^)"'n^/^ 

xef-s^,^}" "'"'"'-Ml 5 j=i 

where the sum goes over all sequences x = (.ti, . . . ,Xfc) € {"'^i '^J''' ^ ' ^ 
stands again for the scalar product, f -x = fjXj . In order to analyze the integrals 
on the right hand side, we will again use the method of steepest decent. To this 
end, we first prove the following analog of Lemma l5.ll As before, /^o is a constant 
such that p{J) is analytic in the strip |3m/| < /io- 

Lemma 5.4. Given pi > Q there are constants ci > and p2 G (0, /io) such that 
the following bound holds whenever > and 771, . . . ^rjk is sequence of 

real numbers with ^ \r/j\ < p2 (^nd rjjXj > for all j : 



^Ml X 1=1 

e^^^i'^-^-^-^) Y[p{{.5 + iT]- f[ dfj + 0(e-^'=i""""). 

-Ml S 3 = 1 

Proof. For j — 1, . . . let Cj be the path consisting of the three line segments 
which join the point —pi to the point —pi + iifj^ the point —pi + irjj to the point 
pi + irjj, and the point pi + irjj to the point pi, respectively. For j = 1, . . . , fc, we 
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replace, one by one, the integrals over the variables fj by integrals over the paths Cj 
and then bound the contribution over the part coming from the two line segments 
joining —fii to —fii + irjj and fii + irjj to fii, respectively. In each of step, we then 
have to bound integrals of the form 

k 

Y[p{f ■ S)""' Y[ e2™^^'//d/jv (5.31) 

S j' = l 

where /i, . . . , /j-i run over the line segments from — /.ti + irjj to /ii + irjj, fj runs 
either over the the line segment from —^i to —pi + irjj or the line segment from 
/xi + irjj to /ii, and /j+i, ■ ■ ■ , fk run over the interval [— /ii, /ii]. To bound these 
integrals, we note that in this domain of integration the real part of Jj has absolute 
value /ii, implying in particular that 

Mi<^|5ie/.|<fcAii. (5.32) 

i 

As a consequence, |5Ref • 5\ < kfii for all 6, and further there exists at least one S 
for which jSRef ■ 5\ > /ii. (In fact it is easy to see that there are at least 2'^^^ such 
5's.) On the other hand, the assumption |?7j| < fi2 implies that |3mf ■ d\ < fi2 
for all 5. 

Consider first a vector S for which /ii < |3fief • d\ < fc/ii, and assume that 
1^2 ^ A'o is chosen in such a way that (|5.2|l holds. Under this assumption, we have 
|p(f • ^)| ^ e~'^% implying that the term p{i ■ 6)"^ contributes a factor that is at 
most e~"'"'°'^i. 

To bound |jo(z)| when we only know that jSRezj < fc^i, we use continuity in 
conjunction with the fact that \p{z)\ < 1 for all real z. Decreasing /i2, if necessary, 
we conclude that \p{z)\ < e^^^ whenever \3iez\ < kpi and |Qmz| < /X2. As a 
consequence, the first product in (|5.31|) can be bounded by 

s 

Since the second product is bounded by one as before, and the integral only con- 
tributes a constant, this proves the lemma. □ 

Next we have to determine the values of the shifts 771 , . . . , 77^ . According to the 
method of steepest decent, we again choose a saddle point of the integrand. Since 
the integrand is now a function of k variables, this now gives a system of k equations 
for T] = (771, . . . ,77fc): 

HSjF'iiS ■ r]) = 2TriXj. (5.34) 
s ^ 

Assume for a moment that this equation has a unique solution r/ — J7(x). We then 
define a function 

G„.fc(x) = y ^F{iS ■ J7(x)) + 27rJ7(x) • x. (5.35) 

^ — ' 77 

s 

Lemma 5.5. Let < a„ = o{y/n), let gmax = o(l), and let x G R'^ be such that 
\xi \ = OLnl \/n. Then the equation (|5.34(l has a unique solution rj = I7(x), 
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an. 



d rjj[x)xj > for all j e {1, . . . , k}. In addition, for sufficiently small fii, 



' g2^„(«x.f-x.,7) Y[p{f-S + irj- S)"' f[ dfj 
'''' ' ^=1 (5.37) 



e 



h /2 



Proof. For x = 0, the equation H5.34|l is obviously solved by — 0. To obtain 
existence and uniqueness of a solution in the neighborhood of zero, we consider the 
derivative matrix of the function on the left hand side, 



A:, (i7) =iJ2^6.,S,F"iiS ■ 77). (5.38) 

Using the fact that F" {f) ~ (27r)^ + 0{f'^) we may expand Aij{T]) as 

A,{V)^A,{0) + 0{MI), (5.39) 

and for rj = we have 

n 



A,j(0) = {2TTyiJ2—SrSj - i^nfiQj, (5.40) 



where Cij is the overlap matrix Cij = q{(T^'^\cr^^^). If the maximal off-diagonal 
overlap ^max is o(l), the matrix C is invertible, implying in particular that Aij{r]) 
is non-singular in a neighborhood of zero. By the implicit function theorem, we 
conclude that for x sufficiently small, the equation (|5.34|) had a unique solution 
?7(x), and by (|5.39() . wc may expand J7(x) as 



2tt 



For Qinax = o(l), the matrix C ^ can be approximated as C^^-^ — Sij + o((7„iax), 
implying in particular that for \xj\ = an/ \/ri, the leading term on the right hand 
side is of order an/\/n. As a consequence, we can convert the additive error into a 
multiplicative error l + O^a^/n), giving the desired bound H5.36|l . Using once more 
that C~j = 6ij + o(gniax), the fact that Xjrjj{x) > is an immediate corollary of 



We are left with the proof of H5.37|) . To this end, we rewrite the left hand side 

as 



Ml ^ 



27rri(ix f-X Tj)g- s nsF{f S+ir] S) 



g -nG,(x) III''' g-„G„ ,(f) 



(5.42) 

Edf, 



Ml j=i 

with 

G„,fc(f) = ^ — (F(f S + irj -S)- F{ir] ■ S)) - 27rix • f. (5.43) 
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Observing that the derivatives of Gn,k{f) at f = vanish by the definition of t7(x), 
we now expand G,i_fc(f) as 

G„,fc(f) = ^ ^ (i(f . ^rF"i^rJ ■S) + ^ii- dfF^'Hzri ■ d) + 0((f • d)'')) . (5.44) 



n \2' ' ' ' ' 3! 



Arguing as in the proof of Proposition 15 . 21 we now have that 

5ieG„,fe(f) > E ■ '^)' = E /^C-,,/, (5.45) 



provided [i\ is sufficiently small and n is sufficiently large. Since the maximal 
off-diagonal overlap Qmax is of order o(f), we conclude that 3fieG'„_fc(f) > ^|jf||i 
provided /ii is sufficiently small and n is sufficiently large. As a consequence, the 
integral in H5.42|l is dominated by configurations for which ||f II2 is smaller than, say, 
logn/y^. More quantitatively, we have 



Ml ^ 



-Ml j=i 

f ||2<log n/v/n ^_-|^ ^ ^||f||2>logn/v^ j—\ 

k 

f||2<logri/Vn j^i 



(5.46) 



(5.47) 



For ||f||2 < \ogn/y/n we expand e fc(^) as 

G„.,(f) = exp(-l E TT^^ ■ ^)'^"(*^ • ^) + 0{n\m) 
s 

= e-5S..^'*^-/^(f + 0(n||f||3)) 
where Af is the matrix with matrix elements 

Mr, = E —5.5jF"{z'n ■ 8) = (27r)2G,, + 0(||r,f ) 

5 " (5.48) 
= {2^f5,, + 0{al/n) + 0(w)- 

As a consequence, 

//t.s,..v.."""''^n* ^ (i)'('+°<""'^'+''<"-/">+«(-">)- 

(5.49) 

Combined with H5.42|l and (|5.46() . this imphes the desired bound (|5.37(l . □ 



Lemma I5. 31 Lemma l5.4l the relation (|5.29|) . Lemma 15.41 and Lemma 15.51 we now 
easily prove the following proposition. 
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Proposition 5.6. Let < = o{y/n) and let < A„ = o{^/n). Ifn is sufficiently 
large and cr'^-' . . . jCrC''' obey the condition (|3.13() . then 



IIL 

E{a^^y) e [ai^^\bi^^^] forj = l,...,k) = (j[ 7^) 



6, 



/27r 



^ e-"«" '=(^) (1 + 0(1)). 



(5.50) 



// we strengthen the condition an ~ o(v^) ^'^ = o(n^/'*), we have 
p(i?(<T(^)) G fel^-'")] for allj^l,..., k) 



(5.51) 



4=1 



Proof. Observe that under the condition H3.13|l . we have that ri,,„in = n2~'^ + o(n) 
and (?max = 0(1) • We may therefore use Lemma 15.31 Lemma 15.41 the relation 
(|5.29|l , Lemma 15.41 and Lemma 15.51 to conclude that under the conditions of the 
proposition, there is a constant c > such that 



E{cT^^^) e [afW), for J = 1, . . . , fcj = H^^*.,^)'^ 

f=i 

E (e-"^"-(^)(^)'^'(l + o(l)) +0(e-") + 0(V^6.))• 



x 2^'' 



(5.52) 

Next wc expand G„^fc(x) with the help of the bounds H5.14|l and H5.36(l . yielding 
the approximation 



G„,fc(x) = ~IY. ^^^""^ ■ + • X + 0(||x| 
s 



= -^(^,C77) + 27rT,.x + 0(||xll|) (5-53) 

= i(x,C-x)+0(^ 

Note that this imphes in particular that ?iG„,fe(x) = O(a^) + 0{an/n) = o{n). 
The leading term on the right hand side of (|5.52() is therefore much larger than 
the additive error terms, which both decay exponentially in n. These additive 
error terms can therefore be converted into a multiplicative error term (1 + o(l)). 
Inserting the value of qn,i{= li^nV^) and using the fact that J^e = ^; this yields 
the approximation (|5.50|) . 

To infer the bound 15.51|l . we use the assumption a„ = o(n^/^) to expand the 
factor {£,n/V2^f as 2-"'''e*-'""/2(l + o(l)), and the factor e-"<^">'=W as 



e 2 



(x,C-ix)+0(ai/„) ^ g-t||x||^+0(n||.||^w) (l + ©(a^ /n)) 

(1+0(1)' 



(5.54) 
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Inserting these estimates into H5.50|l and taking into account the bound H5.12(l , this 
gives H5.51|l . □ 

5.4. Failure of Poisson convergence for faster growing ocn- In this section, 
we prove the last statement of Theorem 12. II for general distributions. Throughout 
this section, we will assume that a„ = 0{n^^'^) and A„ = o{\/n). Near the end of 
the section, we will specialize to a„ = for which we will prove absence of 

Poisson convergence. 

We start from (|5.50() , but instead of H5.53|l we use a more accurate approximation 
for G„,fc(x). To this end, we use the fact that p{f) ~ p{~f) is analytic in a 
neighborhood of zero to infer the existence of a constant C4 such that 

F(z) = ^ ^^(2^)4^4 + o{\zf) 

and 

F'{z) = {2^fz + 4c4(2^)^z^ + 0(|z|^). 

Remark 5.7. Using the fact that ¥.{X^) > E{X'^)'^ = 1 and expanding the log p(f) 
one can see that C4 < j^, with equality holding if and only iiX^ = 1 with probability 
one. Thus for all random variables whose density satisfies assumption (|2.1|l we have 
C4 < 1/12. 

Using these expressions and the fact that C~j^ = (^ij+0(gmax) = 5y +0(A„n~-^/^) 
when <t(^) . . . , cr^'^) obey the condition H3.13|l . we then expand the solution of (|5.34|) 
as 



^(C-^x), + PY: -iC-'SW, C-^xf + 0(||x||^) 
s 

i(C-ix), + pY. -^^(^ ■ + 0(l|x|n + 0(||x|pA„n-V2). 

'.TT /it ^ — ^ ri 



2tt 2tt ^ n 

s 



In order to analyze the sum over S, we use the condition (|3.13() to estimate 

5 6 

The first term is zero unless i,j,k,l are such that either all of them are equal or 
are pairwise equal for some pairing of k, I. Observing that there are 3 possible 
ways to pair four numbers into two pairs of two, this leads to the estimate 

m = ^(C-'^h + ^x,{x^ + 3E^')) + Odlxf A„n-i/2) + 0(||xf ) 

(5.56) 

= ^(C-ix), + (1 + 3{k l))^^;, + 0(||x||-^A„n-i/2) + 0(||x|l5) 

where we used the fact that \xi\ = \xj\ = anl\/n in the last step. Inserting this 
expression into the definition (j5.35|l and expanding the result in a similar way as 
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we expanded 77 above, this leads to the approximation 

s s 

+ Oi\\xf) + Oi\\x\\^Xnn-'/') 

= -^^(»7, Cv) + C4fc(l + 3{k - 1))^ + 27r,7 • X 
+ 0(||xf) + 0(||xrA„n-i/2) 

i(x,C-ix)-fc(l+3(fc-l))^ , 
+ (x, C-ix) + fc(l + 3(fc - 1))^^ + 0(||xf ) 



(5.57) 



= -^(x, C-^x) - fc(l + 3(fc - 1))^^ + C4fc(l + 3(fc - 1))^ 

(x,C-ix) + fc(l 
0(||a-rA„n-i/2) 

:, C^^x) +04/0(1 - 
Using that = 0(n^/'*) and A„ = o(-\/n), this gives 



i(x, C-^x) + C4fc(l + 3(fc - 1))% + O(^) + 0(^A„n-i/2). 



n , ^ a. 



,4 



nG„.fc(x) = -(x, C-^x) + C4fc(l + 3(A: - 1))^ + o(l). 
2 n 

As a consequence, we have that 

m 

p(i?(<T(^-)) e for J = 1, ... , fc) = (n 7^ 



^ xfc 



(5.58) 



whenever cr^^) . . . , crC^' obey the condition H3.13|l . 

Next we expand using that ^„ = (2'-"~"'^-'(7„(q;„))~^ with (7„(q!„) given by 
H5.1U|I . To this end, we approximate r\{x) as t]{x) = ^(1 + 4c4X^) + O(x^) and 
0(0;) as 



2 



G(x) = -i(27r77(x))2 + c^{2^:^{x)f + 27r.T?7(a;) + 0(xS) = y + C42:'^ + 0(a;'^), 
giving the expansion 

in = 2-^"-'^ exp(ia^ + C4^) (l + o(l)) . (5.59) 

Inserting this expansion into H5.58fl we then continue as in the proof of Theorem l3.3l 
to get that 

E[n(^n(4, bi)),,] ^ (n7,'") cxp(^(^^^ ~ 3c4fc(fc - 1))) (1 + 0(1)) . 

e=i 1=1 

(5.60) 

Specializing now to a„ — kv}/^ ^ this implies in particular that all moments of Zn(t) 
are bounded, and 



lim 



E[(Z„(t))fc] =t'=exp(K''(/s-l)fc(^l/4-3c4)). (5.61) 
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In view of Lemma [3.21 and the fact that C4 < 1/12 by R.emark |5.7I this is incom- 
patible with weak convergence to a Poisson random variable. This establishes the 
failure of the modified REM conjecture for lim„^oo ctn^^^ = k > 0. 

Note that the original REM conjecture also fails if lim„^oo an?^"^^^ = k > 0. 
Indeed, in this range, the original scaling (|1.3(l and the modified scaling differ by 
the asymptotically constant factor e"^^** (l-t-o(l)). This makes things worse, since 
now even the first moment does not converge to the desired value unless C4 = 0, 
in which case the original and the modified REM conjecture remain equivalent for 
Cfn = 0{n^^^). This completes the proof of the last statement of Theorem 12. II 

Remark 5.8. It is not hard to see that for a„ = ktt,^/^ the distribution of the overlap 
Qn,t converges again to a superposition of two shifted Gaussians, see Remark 14.21 
where this was discussed for the case where Xi, . . . ,X„ where standard normals. 
Indeed, the only difference with the situation discussed in that remark is the extra 
factor of e~'=<'('^~i)(i+3'^)'* . But this factor docs not depend on the overlap, and 
hence does not influence the overlap distribution. 



6. Analysis of the SK model 

For a„ = 0{n^) and rj <1, the local REM conjecture for the SK model has been 
proved in |BK05aj . To extend the proof up to the threshold an = o(l), a little bit 
more care is needed, but given the analysis of the Npp with Gaussian noise from 
the Sectional this is still relatively straightforward, even though the details vary 
at several places. But for of order n wc have to be quite careful, since now 
several error terms which went to zero before are not vanishing anymore. It turns 
out, however, that for a„/n < eg and eg sufficiently small, we can at least control 
the first three moments, which is enough to prove absence of Poisson convergence 
if limsupa„/7i > 0. 

We start with the analysis of the first moment. 

6.1. First moment. For the SK model, the energy E{cr) is a Gaussian random 
variable with density given by H1.5|l . For an = 0{n) and = o(l), the first moment 
can therefore be approximated as 



nZn{al,bl)]^ 




(6.1) 



'^27771 
7,(1 + 0(1)) 



This proves the convergence of the first moment for a„ < cn and c < log 2. 

6.2. Families of linearly independent configurations. When analyzing the 
factorization properties of the joint distribution of i?(<T(^^), . . . , E{ct'^^^)^ we will 
want to use the representation H3.32|l . which at a minimum, requires that the co- 
variance matrix C defined in H3.33() is invertible. The following Lemma l6. II shows 
that a family of linearly independent configurations leads to an invertible covari- 
ance matrix, and gives a bound on the contribution of the families which are not 
linearly independent. It serves the same purpose as Lemma|^3(2) served for the 
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Npp. Note that statements similar to those in Lemma Ffi . 1 1 were proved in |BK05a| 
under the more restrictive condition that an = 0{n^^') with 77 < 1. 
To state the lemma, we define Rn.k as 

Rn,k = ^ E' ]p(^^('t(^'))G foraUj = l,...,A:), (6.2) 

where the sum runs over pairwise distinct configurations (t^^\ . . . ,cr'''"^ which are 
linearly dependent. 

Lemma 6.1. Let k < 00. Then there exists a constant > such that the 
following statements are true. 

(1) If (T^^\ . . . ^cr^'^^ are linearly independent, the matrix C defined in (|3.33(l is 
invertible, and 

5^'=nx)<(^)''%-(---^-). (6.3) 

(2) If an < nek, then 

Rn.k ^0 as n ^ 00. (6.4) 

Proof. (1) This statement is quite easy and must be known to most experts in the 
field. First, we rewrite the matrix elements of C as Cij = nq{'q'^'^\ri^^^), where 
r]'--^\ ry^'^^ are vectors in {—1,+!}"". Indeed, setting rjl^l = ai^''ar \ where 
r,s = 1, . . .n and i = 1, . . .A;, we see that q{ri^^\'r]^^'^) = ((7(<t(*\ cr^-'))) , implying 
the above representation for C . Next we observe that that the linear independence 
of cr(^\ . . . , cr^*^) implies linear independence of Jy'-^-', . . . , 'ri^''\ which in turn can 
easily be seen to imply linear independence of the row vectors of the matrix with 
matrix elements q{T]'-'^\ r)^-''') ■ This gives that C is non-degenerate. But C is a 
k X k matrix with entries which are multiples of n~^ , so if det C 7^ 0, we must have 
I detC| > n"*^. This imphes the bound (jOl- 

(2) To prove (2), we decompose the sum in H6.2(l according to the rank of the 
matrix M formed by the vectors cr^^^ , . . . , cr^^) . Assume that the rank of M is equal 
to M < fc. Reordering the vectors (t^^\ . . . , (t^'^\ if necessary, let us further assume 
that (T^^\ . . . , cr^") are linearly independent. With the help of (1), we then bound 
the probability on the right hand side of 16.21) by 

P(s(cr(j)) e [ai^^\bi^'^ for all .] = !,..., fc) 

< P(i?(^(^-)) e [al«, for aU J = 1, . . . , u) ^^^^^ 

■^'^ i=i 

To continue, we use the following two facts, proven, e.g., in IBCPOlj (see also 
Lemma 3.9 in jB(MN05j : 

(1) If cr(^) , . ■ . , (y^^^ are pairwise distinct, the rank of M is u < fc and <t(^\ . . . , 
cr(") are linearly independent, then n^(cr''^^\ . . . ,ct'^^^) = for at least one 

(2) Given u < k linearly independent vectors cr^^\ . . . ,<t^"\ there are at most 
2ii(fe-u) -^ays to choose (t^^\ . . . , cr^*^^ in such a way that the rank of M is 
u. 
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As a consequence, of (1), we have 

I max |n5(cr(i) , . . . , cr^")) - 2""n| > 2~''n. 
s 

Combined with Lemma (1) and the property (2) above, we conclude that there 
are at most 0(2""e~^ * ^ = 0(2""e~^ * ways to choose k pairwise dis- 

tinct configurations cr^^), . . . , crC^^ such that the rank of A/ is u. Using the fact 
that {\/n^„)^ = 0(n"e^""2~""), we immediately see that for < Cfcn and 
sufficiently small, the contribution of all configurations cr'-^\ . . . , cr^'^) such that the 
rank of M is smaller than k decays exponentially in n, so in particular it is o(l). □ 

Consider now a family of linearly independent configurations c'-^-', . . . , cr^'"'). We 
claim that for such a family and a„ < cn with c < \/2 log 2, we have 

P(s(cr(^)) e [ai^^\bi^^^] for all j = 1, . . . , fc) 

where a is the vector (a„,...,a„) G K'^ and C is the covariance matrix de- 
fined in To prove this approximation, we have to show that (x, C~^x) = 
{ot,C~^a) = o(l) whenever x is a vector with Xi = a„ -f 0(^„). This in turn 
requires an upper bound on the inverse of C. To prove such a bound we use that 
the matrix elements of C are bounded by n, while det C is bounded from below by 
n^'^. Using Cramer's rule, we conclude that the norm of C^^ is 0(71^'^^^), which 
in turn implies that {x,C~^x) = (a,C"^a) + 0{n'^''~^anin) + 0{n^^~^l'^). For 
a„ < cn with c < \/2 log 2, the error term is o(l), as desired. 

Assume that q;„ < cn with c < min{-y/2 log 2, 6^}. Using first the representation 
then Lemma Ffi. II and the bound 1)6. 6|l . and finally the explicit formula p.6() 
for we now approximate the fc*^ factorial moment as 



m m 



^ — ^ ^ — ^ (6 7) 

X2-- ^" 7^i^e-ll«ll^e-^(---«)(l + o(l))+o(l) 

where the sum goes over famihes of hnearly independent configurations <t^^\ . . . , 

6.3. Poisson convergence for an — o(n). In this section, we prove Theorem l3.6l 

To this end, we again choose A„ in such a way that a„ = o(A„-\/n), A„ = o{^Jn) and 
e~^"^^ decays faster than any power of n. Recalling Lemmas l3.4l fl') and (3), we may 
then restrict the sum in (|6.7|) to a sum over configurations with ^max ~ 0{Xn/^/n). 
Expanding the inverse of C as C^^^ = -^{Sij +0{q^^^)), we then approximate det C 
as detC = + o(l)), and {a,C-^a) as (a, C-Iq;) = \\a\\l + 0{n-^alql^^J. 
Using Lemma 13.41 and Lemma l6. II a second time to extend the sum over families of 
configurations back to a sum over all families of configurations in {— the 
proof of Theorem 13.61 is therefore reduced to the proof of the bound 

2-nk J2 e°(7r"'9.Lx) = 1 + 0(1). (6.8) 
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Since typical configurations lead to a maximal off-diagonal overlap gmax of order 
0(71^^/^), we expect that such a bound holds as long as = o(n). Lemma IV. II in 
Section 17?^ implies that this is indeed the case. 

6.4. Absence of Poisson convergence for faster growing «„• In this section, 
we prove 

Theorem EZI We start by proving the following lemma. 

Lemma 6.2. Given a positive integer k, there are constants > and Cfe < oo 
such that 

m m 

E[l[{Z^{ai,bij}k,] <Ckl[j^'+o{l) (6.9) 

(=1 e=i 

whenever an < ^kn, k = ki and n is large enough. 

Proof. Starting from the approximation (|6.7|l . we will again restrict ourselves to 
configurations obeying the condition H3.13|l . but this time we will choose A„ of the 
form A„ = \/2knek where e^. > will be chosen in such a way that < \/21og2 
and ik < Cfc. Note that our choice of A„ guarantees that for q;„ < ne„, the sum 
over families of configurations violating the condition (|3.13() decays exponentially 
with n. Note further that the condition (|3.13() with A„ = V2,knek guarantees that 
9max ^ 2^*''+^fce^. Expanding det C as det C ~ n'^{l + ©(g^^^)), we therefore have 
that det C > ^n'^ provided ik is chosen small enough. 

Let us finally write the matrix C as C = ^(Ifc + A), where Ij. is the identity 
matrix of size k, and A is the matrix with zero diagonal and off-diagonal entries 
(j2(cr(*),(T(j)). Expanding C^^ as C^^ = i(Ifc - A+ ^), we see that 

(a,C-ia)>l(||a||2-(a,yla)) > ^ - fc^^^^X) • (6-10) 

Together with our lower bound on det C, we see that the proof of the lemma now 
reduces to the bound 

2~nk J2 V2e 5^ < Cfe. (6.11) 

For ek small enough, this bound again follows from Lemma l7. II in Section l7.2l □ 

Note that Lemma 16.21 implies statement (ii) of Theorem 13.71 while the bound 
(|6.1|) implies statement (i). We are therefore left with the proof of (iii), i.e., the 
statement that limsup„_^Q^ E[(Z„(a, 6))2] > 7^ when limsupQ;„/n > 0. Recalling 
the representation, this in turn requires us to prove that 

limsup2-2« V" !^M/2 ^^"°"'^'^^°'^''"^ > ^- (6.12) 

Let q = (7(cr(^), cr'^)) be the off-diagonal overlap of the two configurations cr'-'^' and 
cr'^). Since C is now just a 2 x 2 matrix, both its inverse and its determinant can 
be easily calculated, giving det C = n^{l — q^), and, using that ||q:||2 = ka^ — 2a^, 
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As a consequence, we will have to prove a lower bound on the expression 

2-2" ^ ^_e^TT^. (6.14) 

Taking subsequence, if necessary, let us assume that an/n converges to some k, with 
< K < min{e2, \/2 log 2}. We then bound the sum ^ from below by restricting 
it to all configurations for which \q\ < 2k. Under this restriction, the summand is 
bounded from below by -^===^6''"'' , with /3„ ~ a^/{n{l + 4k2)). Observing that 
(j2 < 1 we may then use Lemma |3.4I (1) and (3) to extend the sum back to a sum 
which runs over all configurations <t'^' and (t^'^\ leading to the lower bound 

2-2" y" ^ r'^T^ > y e^"'Vo(l). (6.15) 

Let E2 denote expectations with respect to the uniform measure over families of 
configurations cr*^),^^^) g |_i^_|_i|"_ Using Holder and the fact that E2[(j2] = ^, 
we then lower bound the right hand side by 

9-2" 

y e^"^-+o(l) 



Vl - 16k 



4 

cr<l).cr<2) 



e^"/" 1 / ^2 . 

> , + 0(1) = , exp + 0(1). 

- VI - 16^4 VI - 16/^4 ^Vi + 4k2^ 



(6.16) 



For K sufficiently small, the right hand side is asymptotically larger than 1, proving 
the desired lower bound (|6.12() . which completes the proof of Theorem 13. 71 



7. Auxiliary results 

7.1. Proof of Lemma 13.51 

Proof of Lemma \3.5\ Wc will have to prove that 

2-"fe y e^t-'''.--**") = 1 + 0(1). (7.1) 

<t(1)....,ct('') 

Recalling the assumption \ f{(T^^\ . . . , cr''^')! < ca^^q^^^y^, let On = o{n) be a sequence 
of positive integers such that 6'„ ^ cx3 as 71 00 and a\9n ~ o{^/n). 

We now split the sum on the left hand side of H7.1(l into two parts: the sum 
over all families of configurations er'^', . . . ,(t'^'^^ that satisfy H3.13() with 0„ taking 
the place of A„ and the sum over the configurations that violate the bound H3.13II , 
again with 6„ replacing A„ . 

Consider the first sum. By Lemma 13.41 the number of terms in this sum is 

bounded between 2"'"'(1 — 2'^+^e^"5^) and 2"'"'. For all these configurations we know 
from Lemma lO that q,na.j, < 2''-^ and hence |/(cr(i), . . . ,(t('^))| < c2''al-^ = 
0(1) as n 00. Using the fact that 0„ — > 00 as n — > 00 we obtain that the 
contribution from the first summation is 1 + o(l) as n ^ 00. 

Thus, to establish Lemma 13.51 it suffices to show that the contribution from the 
second sum is o(l) as n — > 00. To this end wc partition the interval into 
subintervals [k^jK^+i] of length one. Now consider the families of configurations 
(t(^\ . . . ,cr*'') that satisfy H3.13|l when A„ is replaced by k^+i but violate it if A„ 
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is replaced by Ki. It is easy to see that for n large enough, the contribution from 
these terms is bounded by 

Adding up these error terms, we obtain that the second sum is of order ©(e-''"/^) = 
0(1). This completes the proof of Lemma [3.51 □ 

7.2. Proof of Lemma l4.H The proof of Lcmma HTI will be based on the following 
lemma. 

Lemma 7.1. Let k be a positive integer, and let [3 < be a non-negative real. 

If \g{cr'^^\ . . . , cr'-'^^)! < (3q^^^ for all families of configurations (T^^\ . . . , (t^''\ then 

fl-M^/!)<2-"- y e^('^'^-V^')<fl-fc(fc-l)^)"'' (7.2) 
\ 2 n/ ^-^ \ nl 

CT(l),...,cr('=) 

Proof. Let ]Efc[-] denote expectations with respect to the uniform measure over all 
families of configurations cr^-^\ . . . , cr'^'^^ e { — With the help of Jensen's 
inequality, we then immediately obtain the lower bound 



me'] > cxp{-(3Ek[qLJ) > cxp(-/3^Efe[<z, 



(7.3) 

/ ^fc(fc-l)l\ fc(fc-l)l 
\ 2 n/ 2 71 

To obtain an upper bound, we first use Holder's inequality to obtain the estimate 

Efc [eS] < Efe [e^ ] < ff Efc [e^'^"'^ ] < max Efc [e^'^"-^' ] (7.4) 
J- A i<j 

with K = k{k — l)/2. Next we rewrite the expectation on the right hand side as 

-1 POD 2 

Ekie^f^"'^] = -j= / e-'^Efc[e^^*-"]dx. (7.5) 

But now we can calculate the expectation exactly. Together with the inequality 
cosh(j/) < cxp(y^/2). this leads to the estimate 

Efc[e^'2^«'^"'] = (^cosh(^2i^/3x?i"i))" < exp^i^/Jx^n-^). 

Inserting this into H7.5|l and H7.4|l . we have 

E, \e9\ < J— r e^(i^2if/3«-i)4^^ ^ ^ = ^ 

\/2^7-oo - 2A'/3n-i ^l-/c(fc-l)/?n-i' 

the desired upper bound. □ 

Having established the above lemma, we are now ready to prove Lemma [4. II 

Proof of Lemma \^~l\ Let /i((t(^\ . . . , cr'^^^) ~ ^i^j 'lij a-^d f ~ fi + R, and define 

F{0) = Ekief'f], Fm = E,[e^/i], and F^iP) - Ek[eP% 
Since (by ^TH^ ) 

Fi(/3„) = c^^i^h^t^Apl + 0(/?3^-2)) , (7.6) 
we only have to prove that F{(3n) = Fi(/3„)e°(^'""')+°(^) whenever /?„ = o(n). 
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Let e„ = 2k{k — 1)C^ where C is the constant from Lemma l4.ll Applying 
Holder's inequality to Efe[e-'^] we obtain 



and hence 



Using equation (|7.6() one obtains 

+ e ))^ ^ exp(/3^„n-i + 0{f3l/n')) = ^MOWl/n')). 

Recalling that \R\ < Cq^^^^, we would like to use Lemma IV. II to bound the second 
factor. To this end, we need to guarantee that k{k— V)'^^^C^ < 1. By our choice 
of e„ this will be the case as soon as n is large enough to ensure that £„ < 1. For 
n large enough, we therefore have 

F2{Pn^)^ < f 1 - k{k i)i±£!icA.\-5nfe ^ exp(0(/3„/n)). 
Putting everything together, we get 



< cxp(0(/33/n2) ^ o{P,,n~')) = exp(0(/33/n2) + o(l)) . 
Applying Holder's inequality to Efe[e'^^+'^"^ obtain that 

V 1 + e„ / V Cn ^ 



This implies that 



F{f3n) ^ i^i((l+e„)-i/3„)i+^" 1 



Fi(/3„) - Fi(/3„) F2{-Pn^y-' 

Proceeding along similar lines as for the upper bound, we obtain that 

^>exp(o(/3^/n^) + o(l)). 

Combining the two applications of Holder's inequality we finally obtain that 

^^=exp(0(/3:^/n2) + a(l) 

This completes the proof of the lemma. □ 

8. Summary and Outlook 

8.1. Summary of Results. In this paper, we considered the local REM conjecture 
of Bauke, Franz and Mertens, for both the NPP and the SK spin glass model. For 
the Npp we showed that the local REM conjecture holds for energy scales Q!„ of 
order o(n^/^), and fails if a„ grows like nn^^^ with k > 0. For the SK model we 
established a similar threshold, showing that the local REM conjecture holds for 
energies of order o(n), and fails if the energies grow like ku with k > sufficiently 
small. 

Although we believe that the local REM conjecture also fails for still faster 
growing energy scales, our analysis did not allow us to make this rigorous since 
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we could not exclude that the moments of the energy spectrum diverge, while the 
spectrum itself undergoes a re-entrance transition and converges again to Poisson 
for faster growing energy scales. 

8.2. Proof Strategy. Before discussing possible extensions, let us recall our proof 
strategy. Both the proof of Poisson convergence, and the proof that Poisson con- 
vergence fails after a certain point, relied on a precise asymptotic analysis of the 
factorial moments. 

For the purposes of this discussion, let us restrict to the one-dimensional factorial 
moments E[(Z„(a„, 6„))fe] of the number of points in the energy spectrum between 
some a„ and 6„ in the vicinity of a„. We expressed these factorial moments in 
terms of the probability that the energies E{(t^^'>), . . . , i?(cr(*'')) of k pairwisc distinct 
configurations cr^^', . . . , cr^*^' all lie in the interval [a„, 6„], see (|3.11ll . 

The technical meat of our proof then consisted of two steps: a bound on the 
sum over "atypical configurations" (see Lemmas 13.41 and I6.1|l . and a proof that 
for typical configurations, the probability that the energies of cr^^), . . . , crC') all 
lie in the interval [a, b] is asymptotically equal to the product of the probabilities 

p(£;(o-« e 

For both the Npp and the SK model, the value of the threshold can already be 
understood by considering the factorization properties of typical configurations, see 
(|3.24|l . Taking, e.g., the case fc = 2, our results say that for typical configurations 
in the Npp with Gaussian noise, we have factorization up to a factor 

e""(9+'^(«'»(H-o(l)), (8.1) 

where q is the overlap between the two configurations. Since the overlap between 
typical configurations is of order n~^^'^, we obtain factorization if and only if ~ 
o{n^^^). To obtain the same result for the Npp with general distribution was much 
more work, since it required establishing a large deviation density estimate for k a 
priori highly dependent variables, but the heuristic for the threshold of n^^^ is still 
the same. 

By contrast, the threshold for the SK model is easier to establish than that for 
the Npp. The main reason is the restriction to Gaussian noise, which made the 
proof of a large deviation density estimate unnecessary, in addition to simplifying 
the proof of the bound on atypical configurations. But on a heuristic level, there is 
not much of a difference: now we obtain factorization up to multiplicative factor of 

e^(9'+o(«'»(l + o(l)), (8.2) 
giving the threshold of a„ = o(?i) for Poisson convergence. 

8.3. p-Spin models. In the physics literature, one often considers a generalization 
of the SK model which involves interactions between p different spins, instead of 
the two-body interaction of the standard SK model. For our purpose, these p-spin 
SK models are best defined as Gaussian fields indexed by the spin configurations 
(T e { — Recalling that a Gaussian field is uniquely defined by its mean 
and covariance matrix, we then define the p-spin SK-Hamihonian H'^p\(t) as the 
Gaussian field with mean and covariance matrix E[i?(P)(cr)if(P)(o-)] = nq{cT,a)P 
where p = 1,2,..., and q{-, •) is the overlap defined in 1)1. 2|l . Note that the energy 
of the Npp with Gaussian noise is nothing but the absolute value of the p = 1 SK 
Hamiltonian divided by y^. Up to a rescaling by y^, the energy spectrum of the 
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Npp with Gaussian noise is therefore identical to the positive energy spectrum of 
the p = 1 SK model. 

It was shown in |BK05a| that the local REM conjecture holds for p = 1 if 
a„ = 0(71'^/''"'^) for some e > 0, and for p > 2 if a„ = 0{n^~'^). For p ~ 1,2 our 
results establish a little bit more, namely the validity of the REM conjecture for 
a„ = 0(71'^/^) if p = 1, and for Q!„ = o{n) if p = 2. More importantly, our results 
prove that these are actually the thresholds for the validity of the REM conjecture. 

This raises the question about the true threshold for p > 3. Starting with the 
factorization properties for typical configurations, one easily sees that the joint 
density of iJ(cr(i) ),..., i7(cr('^')) is again given by a formula of the form l|3.32|l . 
where C is now the matrix with matrix element Cij — n{q{cr^'^\cr^^')))P . This then 
leads to factorization up to a multiplicative error term 

e#o(«S.ax)(i + o(l)), (8.3) 

where gmax is the maximal off-diagonal overlap of cr^^) , . . . , cr''^^ . If q:„ = 0{n) , this 
gives factorization for typical configurations as long as p > 2. But unfortunately, 
our control over atypical configurations is not good enough to allow a proof of the 
REM conjecture for energies that grow that fast. Indeed, it is easy to see that 
Lemma l6. II can be generalized to p > 2; but an application of the lemma to control 
the error for the k^^ factorial moment requires the condition < nej. where ek is 
a positive constant that goes to zero as /c — > 00. 

We therefore see that our methods can easily be used to prove the local REM 
conjecture for a„ = o(n^^'^) and p = 1 as well as q;„ — o(n) and all p > 2, but they 
are not strong enough to answer the question whether this is the actual threshold 
for p > 3; or whether the REM conjecture remains valid if p > 3 and q;„ grows like 
UK for some small k > 0. 

An even more challenging question is the question of what happens after the 
local REM conjecture fails. This question seems quite hard, and so far it has only 
been answered for the hierarchical GREM-modcl. For this model it has been shown 
|BK05bl IBK05c| that the suitably rescaled energy spectrum converges to a mixed 
Poisson process with density given in terms of a Poisson cascades on R^, where 
the dimension £ becomes larger and larger as k passes through an infinite series of 
thresholds, the first threshold being the value where the local REM conjecture fails 
for the GREM. 

8.4. A Simple Heuristic. Returning now to the Npp, we note that the rigorous 
moment analysis of the threshold is somewhat unintuitive and rather involved. 
Alternatively, let us present a simple heuristic to explain the threshold scale n^^'^. 
To this end it is useful to introduce the gauge invariant magnetization 



Af(cr) = - V,T,sgnX, 

n ^ — ^ 



and consider the joint distribution of M(cr) = ^JnM[(r) and the "signed" energy 
H{(7) introduced in (|3.7() with, as usual. Xi, . . . ,Xn chosen i.i.d. with density p, 
and cr chosen uniformly at random. 

The covariance matrix C of A/(cr) and H{(t) is easily calculated to be 



1 ^l 
M 1 
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where ^ = E[|Xi|] < 1, and its inverse C ^ is given by 



1 

-A^ 1 



Assuming, for the moment, that the joint density of M(cr) and -ff(cr) obeys a local 
limit theorem, we now approximate this density by the Gaussian density 

g{H,M) = } ^-|(g"CiY + M"C-/+2HMCi2) 

2tt\/ det C 



1 



, e W^)^" — (8.4) 

1 -iff2 3_^^(A/_uff)^ 

= p 2 e ^ ' 

27rVdetC 

In this approximation, the distribution of M conditioned on i7 = a is therefore 
Gaussian with mean a/j, and covariance 1 — /i^, implying in particular that the 
expectation of M is equal to a^/^/n. 

Consider now two configurations <y ^cr' ■, both chosen uniformly at random among 
all configurations with magnetization Then the expected overlap between a 
and cr' is A/2(l + o(l)). 

Finally, consider two configurations cr, cr' both chosen uniformly at random 
among all configurations with signed energies in the range 7? € [Q;,a + da], for 
some small da. Since the energies of cr and cr' are correlated, it follows that 
their magnetizations are also correlated. Under the assumptions that (1) the 
joint distribution g{H,M) obeys a local limit theorem, as in H8.4() . and (2) the 
only correlation between these configurations is due to the correlation in their 
magnetizations, it would follow that the expected overlap E[g(cr, cr')] is given by 
+ o(l)) = a2^Vn(l + o(l)). 

Note that the above heuristic focuses on the "signed energy" H{cr) rather than 
the true energy £'(cr) = \H{cr)\. Conditioning instead on E{cr), E{cr') S [a, a + da], 
the above heuristic therefore suggests a bimodal distribution of q{cr^ cr') with peaks 
at ±a2^V"(l + o(l)). 

Now recall that the local REM conjecture says that the overlap, rescaled by 
y/n, is asymptotically normal. However, our above heuristic says that y/nq{(T, cr') 
cannot be asymptotically normal for a growing like n^/^ or faster, in agreement 
with our rigorous results. Thus this heuristic correctly predicts the scale at which 
the REM conjecture breaks down, and suggests that the breakdown could be due 
to correlations in the magnetization of configurations with similar energies of scale 
Qj. greater. 

A detailed calculation, however, suggests that we exercise some caution in the 
application of this heuristic. Although the heuristic correctly predicts the scale of 
the threshold, and the double peak structure of the overlap at the threshold, it 
does not predict the correct position of the peaks when a = K'n}/^. Indeed, for this 
scaling, we rigorously showed that the rescaled overlap distribution converges to a 
convex combination of two Gaussians centered at ±k^, in contrast to the heuristic 
prediction of ±k^/z^. This, in turn, suggests that there are additional correlations 
besides those induced by the magnetization. 
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8.5. Algorithmic Consequences. Over twenty years ago, Karmarkar and Karp 
|KK82j gave a linear time algoritlim for a suboptimal solution of the number par- 
titioning problem. For i.i.d. weights with densities of the form studied in the 
current paper, the typical energy Ekk of the KK solution is of order n~^('°s"\ 
|KK82I IYak96j , while the minimal energy is known to be much smaller |KKLU86l 
ILue98[IHnF0T] . namely of order 2-^("). 

This raises the question of whether one can do better than the KK solution - a 
question which has received much study due to the numerous applications of the 
number partitioning problem. This work has led to many different heuristics, but to 
our knowledge no algorithm with guaranteed performance significantly better than 
KK has emerged. In the absence of a good global alternative to KK, one might try 
to base an improved solution of the problem on a local search algorithm starting 
from the KK solution. But such an approach is unlikely to produce better results, 
as the following argument shows. 

Consider the random Npp as defined in this paper, and let cr be a partition with 
energy E{cr) of the order of Ekk, i-c, E{cr) = 7j-^(iog"). Let o- be a local pertur- 
bation of cr, i.e., let cr be a configuration such that cr and a differ on a small subset 
K C {1, . . . ,7i}, with k ~ \K\ bounded uniformly in n. The signed energies H{(t) 
and H[a) then differ by rT^^^ /S.K{cr k), where ^KicrK) is the random variable 

and aK is the restriction of cr to K. Under mild assumptions on the probability 
density p of the weights Xi,. . . , Xn, the density of Ak{(Tk) is a continuous function 
near zero, and the probability that \Ak\ < e for some small e is of order 9(e) with 
the constants implicit in the 9 symbol depending only on k. 

Obviously, any local improvement algorithm that changes exactly k bits will lead 
to a change in the unsigned energies that is bounded from below by 

S[ ~ n~^^'^ min min |Ai<-(cr&-)|. 

K:\K\=k TK 

Taking into account that there are only (^) < n*"' possible choices for K, we see that 
the probability that (5^, ' < e is bounded by 0(en'^+^/^). We conclude that with high 
probability, (5^, ' is at least 6'(n~^/^~'''), much larger than the energy n~^'^'°s") of 
our starting configuration cr. Thus any local improvement algorithm that changes 
k bits moves us with high probability far away from the starting configuration with 
energy of order 

Note that this simple argument does not use very much; in particular, it is not 
related to REM conjecture, which suggests a much deeper reason for the apparent 
difficulty of the Npp. Indeed, applying our local REM Theorem to a = Ekk, it 
says that in the vicinity of Ekk, the energy behaves like a random cost function of 
2n-i independent random variables. If the energy of the Npp were truly a random 
cost function of 2"^^ independent random variables, this would imply that there 
is no algorithm faster than exhaustive search, implying a running time exponential 
in n. But of course, we have not come near to proving anything as strong as this. 

In fact, even on a non-rigorous level some caution is required when applying 
the above heuristic. Indeed, only n linear independent configurations cr'^', 
cr^") are needed to completely determine the random variables Xi, . . . , Xn from the 
energies E{cr^^'>), . . . , E{(t^^^), implying that the energy spectrum lies in a subspace 
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of dimension n, not 2". Still, our REM Theorem proves that the energy spectrum 
behaves locally like that of a random cost function, suggesting several possible 
conjectures. 

The first conjecture is best described in an oracle setting, where the oracle, O, 
keeps the n weights Xi , . . . , X„ secret from the algorithm A. The algorithm is 
given the KK-solution cr'-^^^ and its energy Ekk, and successively asks the oracle 
for the energy of some configurations (t^'^\ . . . , (t^"^\ where m is bounded. Given 
this information, the algorithm then calculates a new approximation cr ^ ±cr^^^K 
Given our REM Theorem, we conjecture that with high probability (tending to one 
as n ^ oo), the energy of the new configuration cr is much larger than Ekk', in 
fact, we conjecture that with high probability E{a) / Ekk —fOOBsn—t oo. 

The second conjecture gives the algorithm much more input, namely the ^ first 
energies above a threshold a and the configurations corresponding to these energies. 
The task of the algorithm is now to find a new partition a whose energy is as near 
possible to a, which we assume to grow only slowly with n (say like o(n~^/^), to stay 
in the realm of our Theorem l2.1(l . Again, the algorithm has no access to the original 
weights, but may ask the oracle O for the energies of m additional configurations 
cr'^^\ . . . , cr^'"', adapting the next question to the answer of the preceding ones. For 
m and (. bounded uniformly in n, we then conjecture that with high probability the 
algorithm produces a configuration cr with {E(a) — a)^^^ — > oo, while the actual 
value of the next configuration above a is with high probability equal to a + 0(Cn) 
by our Theorem 12. II 

One finally might want to consider the above conjectures in a setting where the 
oracle only reveals the relative order of the energies £^(cr(^^), . . . , E{cr^"^^), but keeps 
the numerical values of E{(t^^'>), . . . , E{cr^"^'>) secret. In such a setting, there is no 
a priori reason to assume that m < n. Instead, it seems reasonable to conjecture 
inapproximability results for values of m and i that are polynomial in n. 
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